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The purpose of this user's guide is to serve as a reference book for the 
TMS320C3x generation of digital signal processors that includes 
TMS320C30, TMS320C30-27, TMS320C30-40, TMS320C31, and 
TMS320C31 -27. Throughout the book, all references to the TMS320C30 ap- 
ply to the TMS320C30-27 and TMS320C30-40 as well, and TMS320C31 re- 
fers to TMS320C31 and TMS320C31-27, unless an exception is noted. This 
document provides information to assist managers and hardware/software en- 
gineers in application development. 



How to Use This Manual 

This document contains the following chapters: 

Chapter 1 Introduction 

A general description of the TMS320C30 and TMS320C31 , their key features 
(features differ), and typical applications. 

Chapter 2 Architectural Overview 

Functional block diagram. TMS320C3x design description, hardware compo- 
nents, and device operation. Instruction set summary. 

Chapter 3 CPU Registers, Memory, and Cache 

Description of the registers in the CPU register file. Memory maps provided 
and instruction cache architecture, algorithm, and control bits explained. 

Chapter 4 Data Formats and Floating-Point Operation 

Description of signed and unsigned integer and floating-point formats. Discus- 
sion of floating-point multiplication, addition, subtraction, normalization, 
rounding, and conversions. 

Chapter 5 Addressing 

Operation, encoding, and implementation of addressing modes. Format de- 
scriptions. System stack management. 

Chapter 6 Program Flow Control 

Software control of program flow with repeat modes and branching. Inter- 
locked operations. Reset and interrupts. 
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Chapter 7 External Bus Operation 

Description of primary and expansion interfaces. External interface timing dia- 
grams. Programmable wait-states and bank switching. 

Chapter 8 Peripherals 

Description of the DMA controller, timers, and serial ports. 

Chapter 9 Pipeline Operation 

Discussion of the pipeline of operations on the TMS320C3x. 

Chapter 10 Assembly Language Instructions 

Functional listing of instructions. Condition codes defined. Alphabetized indi- 
vidual instruction descriptions with examples. 

Chapter 11 Software Applications 

Software application examples for the use of various TMS320C3x instruction 
set features. 

Chapter 12 Hardware Applications 

Hardware design techniques and application examples for interfacing to me- 
mories, peripherals, or other microcomputers/microprocessors. 

Chapter 13 TMS320C3x Signal Description and Electrical Characteristics 

Pin locations, pin descriptions, dimensions, electrical characteristics, signal 
timing diagrams and characteristics. 

Appendix A Instruction Opcodes 

List of the opcode fields for all the TMS320C3x instructions. 

Appendix B Development Support/Part Order Information 

Listings of the hardware and software available to support the TMS320C3x de- 
vice. 

Appendix C Quality and Reliability 

Discussion of Texas Instruments quality and reliability criteria for evaluating 
performance. 

Appendix D Calculation of TMS320C30 Power Dissipation 

Information used to determine the power dissipation and the thermal manage- 
ment requirements for the TMS320C30. 

Appendix E SMJ320C30 Digital Signal Processor Data Sheet 

Data sheet for the SMJ320C30 digital signal processor. 

Appendix F Quick Reference Guide 

Over 30 pages of the most referenced tables and figures. 
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Style and Symbol Conventions 

This document uses the following conventions: 



□ Program listings, program examples, interactive displays, filenames, and 
symbol names are shown in a special font. Examples use a bold version 
of the special font for emphasis. Here is a sample program listing: 



0011 


0005 


0001 


.field 


1, 


2 


0012 


0005 


0003 


. field 


3, 


4 


0013 


0005 


0006 


.field 


6, 


3 


0014 


0006 




.even 







Q In syntax descriptions, the instruction, command, or directive is in a bold 
face font and parameters are in italics. Portions of a syntax that are in bold 
face should be entered as shown; portions of a syntax that are in italics 
describe the type of information that should be entered. Here is an exam- 
ple of a directive syntax: 

.asect "section name", address 

.asect is the directive. This directive has two parameters, indicated by sec- 
tion name and address. When you use .asect, the first parameter must be 
an actual section name, enclosed in double quotes; the second parameter 
must be an address. 

£1 Square brackets ( [ and ] ) identify an optional parameter. If you use an op- 
tional parameter, you specify the information within the brackets; you don't 
enter the brackets themselves. Here's an example of an instruction that 
has an optional parameter: 

LALK 1 6-bit constant [, shift] 

The LALK instruction has two parameters. The first parameter, 16-bit con- 
stant, is required. The second parameter, shift, is optional. As this syntax 
shows, if you use the optional second parameter, you must precede it with 
a comma. 

Square brackets are also used as part of the pathname specification for 
VMS pathnames; in this case, the brackets are actually part of the path- 
name (they are not optional). 
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Braces ( {and} ) indicate a list. The symbol | (read as or) separates items 
within the list. Here's an example of a list: 

{ * I *+ 1 *- } 

This provides three choices: *, *+, or *-. 

Unless the list is enclosed in square brackets, you must choose one item 
from the list. 

Some directives can have a varying number of parameters. For example, 
the .byte directive can have up to 100 parameters. The syntax for this di- 
rective is 

.byte valuei [,..., value n ] 

This syntax shows that .byte must have at least one value parameter, but 
you have the option of supplying additional value parameters separated by 
commas. 



Information About Cautions and Warnings 



This book may contain cautions and warnings. 



Qi A caution describes a situation that could potentially cause your system 
to behave unexpectedly. 



This is what a caution looks like. 



The information in a caution is provided for your information. Please read each 
caution carefully. 
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Chapter 1 

Introduction 



The TMS320C3x generation of digital signal processors (DSPs) are high-per- 
formance CMOS 32-bit floating-point devices in the TMS320 family of 
single-chip digital signal processors. Since 1 982, when the TMS3201 was in- 
troduced, the TMS320 family, with its powerful instruction sets, high-speed 
number-crunching capabilities, and innovative architectures, established itself 
as the industry standard and is ideal for DSP applications. 

The TMS320 family consists of five generations: TMS320C1x, TMS320C2x, 
TMS320C3x, TMS320C4x, and TMS320C5x (see Figure 1 -1 ). The expansion 
includes enhancements of earlier generations and more powerful new genera- 
tions of digital signal processors. 
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Figure 1-1. TMS320 Device Evolution 
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The 50-ns cycle time of the TMS320C30-40 allows it to execute operations at 
a performance rate (up to 40 MFLOPS and 20 MIPS) previously available only 
on a supercomputer. The generation's performance is further enhanced 
through its large on-chip memories, concurrent DMA controller, two external 
interface ports, and instruction cache. 

This chapter presents the following major topics: 

Q Processor General Description (Section 1.1 on page 1-3) 

Qi Key Features (TMS320C30— Section 1.2 on page 1-5, TMS320C31 — 
Section 1.3 on page 1-6) 

□ Typical Applications (Section 1.4 on page 1-7) 
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1.1 General Description 



The TMS320's internal busing and special digital signal processing (DSP) in- 
struction set have the speed and flexibility to execute up to 50 MFLOPS (million 
floating-point operations per second). The TMS320 family optimizes speed by 
implementing functions in hardware that other processors implement through 
software or microcode. This hardware-intensive approach provides power pre- 
viously unavailable on a single chip. 

The emphasis on total system cost has resulted in a less expensive processor 
that can be designed into systems currently using costly bit-slice processors. 
Also, cost/performance selection is provided by the different processors in the 
TMS320C3x line: 

Q TMS320C30: 60-ns single-cycle execution-time 

Ql TMS320C30-27: Lower-cost, 74-ns single-cycle execution time 

Ql TMS320C30-40: Higher speed, 50-ns single-cycle execution time 

Q TMS320C31 : Low cost, 60-ns single-cycle execution time 

Ql TMS320C31-27: Lower cost, 74-ns single-cycle execution time 

All of these processors are described in this user's guide. Essentially, their 
functionality is the same. However, electrical and timing characteristics vary 
(described in Chapter 13); part numbering information is found in Section B.4 
on page B-15. Throughout this book, TMS320C3x is used to refer to the 
TMS320C30 and TMS320C31 and all speed variations. TMS320C30 and 
TMS320C31 are used to refer to all speed variants of those processors where 
that is appropriate. Special references, such as TMS320C30-40, are used to 
note any specific exceptions. 

The TMS320C30 and TMS320C31 can perform parallel multiply and ALU op- 
erations on integer or floating-point data in a single cycle. The processor also 
possesses a general-purpose register file, program cache, dedicated auxiliary 
register arithmetic units (ARAL)), internal dual-access memories, one DMA 
channel supporting concurrent I/O, and a short machine-cycle time. High per- 
formance and ease of use are products of those features. 

General-purpose applications are greatly enhanced by the large address 
space, multiprocessor interface, internally and externally generated wait 
states, two external interface ports (one on the TMS320C31 ), two timers, two 
serial ports (one on the TMS320C31), and multiple interrupt structure. The 
TMS320C3x supports a wide variety of system applications from host proces- 
sor to dedicated coprocessor. 

High-level language is more easily implemented through a register-based ar- 
chitecture, large address space, powerful addressing modes, flexible instruc- 
tion set, and well-supported floating-point arithmetic. 

Figure 1-2 is a functional block diagram that shows the interrelationships be- 
tween the various TMS320C3x key components. 
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Figure 1-2. TMS320C3x Block Diagram 
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TMS320C30 Key Features 

1 .2 TMS320C30 Key Features 

Some key features of the TMS320C30 are listed below. 
Ql Performance 

1) TMS320C30 

B 60-ns single cycle instruction execution time 

B 33.3 MFLOPS (million floating-point operations per second) 

■ 1 6.7 MIPS (million instructions per second) 

2) TMS320C30-27 

74-ns single cycle instruction execution time 
m 27 MFLOPS 
B 13.5 MIPS 

3) TMS320C30-40 

b 50-ns single cycle instruction execution time 
M 40 MFLOPS 
B 20 MIPS 

Q One 4K x 32-bit single-cycle dual-access on-chip ROM block 

Ql Two 1 K x 32-bit single-cycle dual-access on-chip RAM blocks 

Q 64 x 32-bit instruction cache 

Q 32-bit instruction and data words, 24-bit addresses 

Q 40/32-bit floating-point/integer multiplier and ALU 

Q 32-bit barrel shifter 

Ql Eight extended-precision registers (accumulators) 

Ql Two address generators with eight auxiliary registers and two auxiliary 

register arithmetic units 
Ql On-chip direct memory access (DMA) controller for concurrent I/O and 

CPU operation 

Q Integer, floating-point, and logical operations 

Ql Two- and three-operand instructions 

Q Parallel ALU and multiplier instructions in a single cycle 

Ql Block repeat capability 

Ql Zero-overhead loops with single-cycle branches 

□l Conditional calls and returns 

Q Interlocked instructions for multiprocessing support 

□ Two 32-bit data buses (24- and 1 3-bit address) 

Ql Two serial ports to support 8/1 6/24/32-bit transfers 

□ Two 32-bit timers 

Ql Two -general-purpose external flags, four external interrupts 
Q 181-pin grid array (PGA) package; 1 -jam CMOS 
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1.3 TMS320C31 Key Features 



The TMS320C31 and TMS320C31-27 devices are low-cost 32-bit DSPs that 
offer the advantages of a floating-point processor and ease of use. These de- 
vices are object-code compatible with the TMS320C30. The devices are func- 
tionally equivalent and differ only in their respective electrical and timing char- 
acteristics. Chapter 13 describes these differences in detail. 

TMS320C31 features are identical to those of the TMS320C30 device, except 
that the TMS320C31 uses a subset of the TMS320C30's standard peripheral 
and memory interfaces — thus maintaining the TMS320C30 core CPU 
33-MFLOP performance while providing the cost advantages associated with 
132-pin plastic quad flat pack (PQFP) packaging. 

The TMS320C31-27 is a slower speed, pin and object-code compatible ver- 
sion of the TMS320C31 . It delivers 27 MFLOPS (million floating-point opera- 
tions per second) and runs at 27 MHz. The reduced speed allows you to realize 
an immediate system cost reduction by using slower off-chip memories and a 
lower cost processor. 

Some key features of the TMS320C31 , including those which differentiate it 
from the TMS320C30, are summarized as follows: 

Qi Performance 

■ TMS320C31 

■ 60-ns single-cycle instruction execution time 

■ 33.3 MFLOPS 

■ 1 6.7 MIPS (million instructions per second) 

■ TMS320C31-27 

■ 74-ns single-cycle instruction execution time 

■ 27 MFLOPS 

■ 13.5MIPS 

□ Flexible boot program loader 

Q One serial port to support 8/1 6/24/32-bit transfers 
Qi One 32-bit data bus (24-bit address) 

□ 1 32-pin plastic quad flat pack (PQFP) package, .8 urn CMOS 
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1.4 Typical Applications 



The TMS320 family's versatility, realtime performance, and multiple functions 
offer flexible design approaches in a variety of applications, which are shown 
in Table 1-1. 



Table 1-1. Typical Applications of the TMS320 Family 
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CAM 


MR Imaging 
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Chapter 2 

Architectural Overview 



The TMS320C3x architecture (shown in Figure 2-1) responds to system 
demands that are based on sophisticated arithmetic algorithms and which em- 
phasize both hardware and software solutions. High performance is achieved 
through the precision and wide dynamic range of the floating-point units, large 
on-chip memory, a high degree of parallelism, and the DMA controller. 

Major areas of discussion are listed below. 

Qi Central Processing Unit (CPU) (Section 2.1 on page 2-3) 

■ Floating-point/integer multiplier 

■ ALU for floating-point, integer, and logical operations 

■ 32-bit barrel shifter 

■ Internal buses (CPU1/CPU2 and REG1/REG2) 

■ Auxiliary register arithmetic units (ARAUs) 

■ CPU register file 

□ Memory Organization (Section 2.2 on page 2-9) 

■ RAM, ROM, and cache 

■ Memory maps 

■ Memory addressing modes 

■ Instruction set summary 

□i Internal Bus Operation (Section 2.3 on page 2-22) 
Qi External Bus Operation (Section 2.4 on page 2-23) 

□ Peripherals (Section 2.5 on page 2-24) 

■ Timers 

■ Serial ports 

□i Direct Memory Access (DMA) (Section 2.6 on page 2-26) 

□ System Integration (Section 2.7 on page 2-27) 
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Figure 2-1. TMS320C3x Block Diagram 
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Central Processing Unit (CPU) 



2.1 Central Processing Unit (CPU) 



The TMS320C3x has a register-based CPU architecture. The CPU consists 
of the following components: 

□i Floating-point/integer multiplier 

Qi ALU for performing arithmetic: floating-point, integer, and logical opera- 
tions 

Q 32-bit barrel shifter 

Q Internal buses (CPU1/CPU2 and REG1/REG2) 

Q Auxiliary register arithmetic units (ARAUs) 

□i CPU register file 

Figure 2-2 shows the various CPU components that are discussed in the 
succeeding subsections. 
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Figure 2-2. Central Processing Unit 
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2.1.1 Multiplier 



Central Processing Unit (CPU) 



The multiplier performs single-cycle multiplications on 24-bit integer and 32-bit 
floating-point values. The TMS320C3x implementation of floating-point arith- 
metic allows for floating-point operations at fixed-point speeds via a 50-ns in- 
struction cycle and a high degree of parallelism. To gain even higher through- 
put, you can use parallel instructions to perform a multiply and ALU operation 
in a single cycle. 

When the multiplier performs floating-point multiplication, the inputs are 32-bit 
floating-point numbers, and the result is a 40-bit floating-point number. When 
the multiplier performs integer multiplication, the input data is 24 bits and yields 
a 32-bit result. Refer to Chapter 4 for detailed information on data formats and 
floating-point operation. 



2.1 .2 Arithmetic Logic Unit (ALU) 



The ALU performs single-cycle operations on 32-bit integer, 32-bit logical, and 
40-bit floating-point data, including single-cycle integer and floating-point con- 
versions. Results of the ALU are always maintained in 32-bit integer or 40-bit 
floating-point formats. The barrel shifter is used to shift up to 32 bits left or right 
in a single cycle. Refer to Chapter 4 for detailed information on data formats 
and floating-point operation. 

Internal buses, CPU1/CPU2 and REG1/REG2, carry two operands from 
memory and two operands from the register file, thus allowing parallel multi- 
plies and adds/subtracts on four integer or floating-point operands in a single 
cycle. 



2.1.3 Auxiliary Register Arithmetic Units (ARAUs) 



Two auxiliary register arithmetic units (ARAUO and ARAU1 ) can generate two 
addresses in a single cycle. The ARAUs operate in parallel with the multiplier 
and ALU. They support addressing with displacements, index registers (IRO 
and IR1), and circular and bit-reversed addressing. Refer to Chapter 5 for a 
description of addressing modes. 



2.1.4 CPU Register File 



The TMS320C3x provides 28 registers in a multiport register file that is tightly 
coupled to the CPU. All of these registers can be operated upon by the multipli- 
er and ALU, and can be used as general-purpose registers. However, the reg- 
isters also have some special functions. For example, the eight extended-pre- 
cision registers are especially suited for maintaining extended-precision float- 
ing-point results. The eight auxiliary registers support a variety of indirect ad- 
dressing modes and can be used as general-purpose 32-bit integer and logical 
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registers. The remaining registers provide system functions such as address- 
ing, stack management, processor status, interrupts, and block repeat. Refer 
to Chapter 6 for detailed information and examples of stack management and 
register usage. 

The register names and assigned functions are listed in Table 2-1. Following 
the table, the function of each register or group of registers is briefly described. 
Refer to Chapter 3 for detailed information on each of the CPU registers. 



Table 2-1. CPU Registers 



Register 
Name 


Assigned Function 


RO 
R1 
R2 
R3 
R4 
R5 
R6 
R7 


Extended-precision register 
Extended-precision register 1 
Extended-precision register 2 
Extended-precision register 3 
Extended-precision register 4 
Extended-precision register 5 
Extended-precision register 6 
Extended-precision register 7 


ARO 
AR1 
AR2 
AR3 
AR4 
AR5 
AR6 
AR7 


Auxiliary register 
Auxiliary register 1 
Auxiliary register 2 
Auxiliary register 3 
Auxiliary register 4 
Auxiliary register 5 
Auxiliary register 6 
Auxiliary register 7 


DP 
IRO 
IR1 
BK 
SP 


Data-page pointer 
Index register 
Index register 1 
Block size 
System stack pointer 


ST 

IE 

IF 

IOF 


Status register 
CPU/DMA interrupt enable 
CPU interrupt flags 
I/O flags 


RS 
RE 
RC 


Repeat start address 
Repeat end address 
Repeat counter 


PC 


Program counter 



The extended-precision registers (R7— RO) are capable of storing and sup- 
porting operations on 32-bit integer and 40-bit floating-point numbers. Any in- 
struction that assumes the operands are floating-point numbers uses bits 
39 — 0. If the operands are either signed or unsigned integers, only bits 
31—0 are used; bits 39 —32 remain unchanged. This is true for all shift oper- 
ations. Refer to Chapter 4 for extended-precision register formats for float- 
ing-point and integer numbers. 

The 32-bit auxiliary registers (AR7 — ARO) can be accessed by the CPU and 

modified by the two Auxiliary Register Arithmetic Units (ARAUs). The primary 
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function of the auxiliary registers is the generation of 24-bit addresses. They 
can also be used as loop counters or as 32-bit general-purpose registers that 
can be modified by the multiplier and ALU. Refer to Chapter 5 for detailed infor- 
mation and examples of the use of auxiliary registers in addressing. 

The data page pointer (DP) is a 32-bit register. The eight LSBs of the data 
page pointer are used by the direct addressing mode as a pointer to the page 
of data being addressed. Data pages are 64 K words long with a total of 256 
pages. 

The 32-bit index registers (IRO, IR1) contain the value used by the Auxiliary 
Register Arithmetic Unit (ARAU) to compute an indexed address. Refer to 
Chapter 6 for examples of the use of index registers in addressing. 

The ARAU uses the 32-bit block size register (BK) in circular addressing to 
specify the data block size. 

The system stack pointer (SP) is a 32-bit register that contains the address 
of the top of the system stack. The SP always points to the last element pushed 
onto the stack. A push performs a preincrement, and a pop performs a post- 
decrement of the system stack pointer. The SP is manipulated by interrupts, 
traps, calls, returns, and the PUSH and POP instructions. Refer to Section 5.5 
for information about system stack management. 

The status register (ST) contains global information relating to the state of the 
CPU. Typically, operations set the condition flags of the status register accord- 
ing to whether the result is zero, negative, etc. This includes register load and 
store operations as well as arithmetic and logical functions. When the status 
register is loaded, however, a bit-for-bit replacement is performed with the con- 
tents of the source operand, regardless of the state of any bits in the source 
operand. Therefore, following a load, the contents of the status register are 
identically equal to the contents of the source operand. This allows the status 
register to be easily saved and restored. See Table 3-2 for a list and definitions 
of the status register bits. 

The CPU/DMA interrupt enable register (IE) is a 32-bit register. The CPU in- 
terrupt enable bits are in locations 10 — 0. The DMA interrupt enable bits are 
in locations 26 — 1 6. A 1 in a CPU/DMA interrupt enable register bit enables 
the corresponding interrupt. A disables the corresponding interrupt. Refer to 
subsection 3.1 .8 for bit definitions. 

The CPU interrupt flag register (IF) is also a 32-bit register (see subsection 
3.1 .9). A 1 in a CPU interrupt flag register bit indicates that the corresponding 
interrupt is set. A indicates that the corresponding interrupt is not set. 

The I/O flags register (IOF) controls the function of the dedicated external 
pins, XFO and XF1 . These pins may be configured for input or output and may 
also be read from and written to. See subsection 3.1 .10 for detailed informa- 
tion. 
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The repeat counter (RC) is a 32-bit register used to specify the number of 
times a block of code is to be repeated when performing a block repeat. When 
the processor is operating in the repeat mode, the 32-bit repeat start address 
register (RS) contains the starting address of the block of program memory 
to be repeated, and the 32-bit repeat end address register (RE) contains the 
ending address of the block to be repeated. 

The program counter (PC) is a 32-bit register containing the address of the 
next instruction to be fetched. Although the PC is not part of the CPU register 
file, it is a register that can be modified by instructions that modify the program 
flow. 
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2.2 Memory Organization 



The total memory space of the TMS320C3x is 1 6M (million) 32-bit words. Pro- 
gram, data, and I/O space are contained within this 1 6M-word address space, 
thus allowing tables, coefficients, program code, or data to be stored in either 
RAM or ROM. In this way, memory usage is maximized and memory space al- 
located as desired. 



2.2.1 RAM, ROM, and Cache 



Figure 2-3 shows how the memory is organized on the TMS320C3x. RAM 
blocks and 1 are each 1 K x 32 bits. The ROM block, available only on the 
TMS320C30, is 4K x 32 bits. Each RAM and ROM block is capable of support- 
ing two CPU accesses in a single cycle. The separate program buses, data 
buses, and DMA buses allow for parallel program fetches, data reads and 
writes, and DMA operations. For example: the CPU can access two data val- 
ues in one RAM block and perform an external program fetch in parallel with 
the DMA loading another RAM block, all within a single cycle. 

A 64 x 32-bit instruction cache is provided to store often repeated sections of 
code, thus greatly reducing the number of off-chip accesses necessary. This 
allows for code to be stored off-chip in slower, lower-cost memories. The exter- 
nal buses are also freed for use by the DMA, external memory fetches, or other 
devices in the system. 

Refer to Chapter 3 for detailed information about the memory and instruction 
cache. 
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Figure 2-3. Memory Organization 
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2.2.2 Memory Maps 



The memory map is dependent upon whether the processor is running in the 
microprocessor mode (MC/MP or MCBL/MP = 0) or the microcomputer mode 
(MC/MP or MCBL/MP = 1 ). The memory maps for these modes are similar (see 
Figure 2-4). Locations 800000h through 801 FFFh are mapped to the expan- 
sion bus . When this region, available only on the TMS320C30, is accessed, 
MSTRB is active. Locations 802000h through 803FFFh are reserved. Loca- 
tions 804000h through 805FFFh are mapped to the exp ansion bu s. When this 
region, available only on the TMS320C30, is accessed, IOSTRB is active. Lo- 
cations 806000h through 807FFFh are reserved. All of the memory-mapped 
peripheral registers are in locations 808000h through 8097FFh. In both modes, 
RAM block is located at addresses 809800h through 809BFFh, and RAM 
block 1 is located at addresses 809C00h through 809FFFh. Locations 
80A000 h through OFFFFFFh are accessed over the external memory port 
(STRB active). 

In microprocessor mode, the 4K on-chip ROM (TMS320C30) or bootloader 
(TMS320C31 ) is not mapped into the TMS320C3x memory map. Locations Oh 
through OBFh consist of interrupt vector, trap vector, an d reser ved locations, 
all of which are accessed over the external memory port (STRB active). Loca- 
tions OCOh through 7FFFFFh are also accessed over the external memory 
port. 

In microcomputer mode, the 4K on-chip ROM (TMS320C30) or bootloader 
(TMS320C31 ) is mapped into locations Oh through OFFFh. There are 1 92 loca- 
tions (Oh through OBFh) within this block for interrupt vectors, trap vectors, and 
a reserved space. Loc ations 1 000h through 7FFFFFh are accessed over the 
external memory port (STRB active). 

Section 3.2 describes the memory maps in greater detail. The peripheral bus 
map and the vector locations for reset, interrupts, and traps are also given. 



Be careful! Access to a reserved area produces unpredictable re- 
sults. 
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Figure 2-4. TMS320C30 Memory Maps 
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Figure 2-5. TMS320C31 Memory Maps 
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2.2.3 Memory Addressing Modes 



The TMS320C3x supports a base set of general-purpose instructions as well 
as arithmetic-intensive instructions that are particularly suited for digital signal 
processing and other numeric-intensive applications. Refer to Chapter 5 for 
detailed information on addressing. 

Five groups of addressing modes are provided on the TMS320C3x. Six types 
of addressing may be used within the groups, as shown in the following list: 

Q General addressing modes: 

■ Register. The operand is a CPU register. 

■ Short immediate. The operand is a 1 6-bit immediate value. 

■ Direct. The operand is the contents of a 24-bit address. 

■ Indirect. An auxiliary register indicates the address of the operand. 
Qi Three-operand addressing modes: 

■ Register. Same as for general addressing mode. 

■ Indirect. Same as for general addressing mode. 

Qi Parallel addressing modes: 

■ Register. The operand is an extended-precision register. 

■ Indirect. Same as for general addressing mode. 
Qi Long-immediate addressing mode. 

■ Long-immediate. The operand is a 24-bit immediate value. 

Qi Conditional branch addressing modes: 

■ Register. Same as for general addressing mode 

■ PC-relative. A signed 1 6-bit displacement is added to the PC. 



2.2.4 Instruction Set Summary 



Table 2-2 lists the TMS320C3x instruction set in alphabetical order. Each table 
entry shows the instruction mnemonic, description, and operation. Refer to 
Chapter 1 for a functional listing of the instructions and individual instruction 
descriptions. 
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Table 2-2. Instruction Set Summary 



Mnemonic 


Description 


Operation 


ABSF 


Absolute value of a floating-point number 


|src| -> Rn 


ABSI 


Absolute value of an integer 


|src| -> Dreg 


ADDC 


Add integers with carry 


src + Dreg + C — » Dreg 


ADDC3 


Add integers with carry (3-operand) 


srd + src2 + C — > Dreg 


ADDF 


Add floating-point values 


src + Rn — > Rn 


ADDF3 


Add floating-point values (3-operand) 


srd + src2 — > Rn 


ADDI 


Add integers 


src + Dreg — » Dreg 


ADDI3 


Add integers (3-operand) 


srd + src2 + — » Dreg 


AND 


Bitwise logical-AND 


Dreg AND src — > Dreg 


AND3 


Bitwise logical-AND (3-operand) 


srd AND src2 — > Dreg 


ANDN 


Bitwise logical-AND with complement 


Dreg AND src — > Dreg 


ANDN3 


Bitwise logical-ANDN (3-operand) 


srd AND irc2 — > Dreg 


ASH 


Arithmetic shift 


If count > 0: 

(Shifted Dreg left by count) — > Dreg 
Else: 

(Shifted Dreg right by |count|) — > Dreg 


ASH3 


Arithmetic shift (3-operand) 


If count > 0: 

(Shifted src left by count) — > Dreg 
Else: 

(Shifted src right by |count|) — > Dreg 


Bcond 


Branch conditionally (standard) 


If cond = true: 
If Csrc is a register, Csrc — > PC 
If Csrc is a value, Csrc + PC -> PC 

Else, PC + 1 -» PC 


BcondD 


Branch conditionally (delayed) 


If cond = true: 
If Csrc is a register, Csrc — » PC 
If Csrc is a value, Csrc + PC + 3 — > PC 

Else, PC + 1 ->PC 



LEGEND: 

src general addressing modes 

srd three-operand addressing modes 

src2 three-operand addressing modes 

Csrc conditional-branch addressing modes 

Sreg register address (any register) 

count shift value (general addressing modes) 

SP stack pointer 

GIE global interrupt enable register 

RM repeat mode bit 

TOS top of stack 



Dreg register address (any register) 

Rn register address (R7 — RO) 

Daddr destination memory address 

ARn auxiliary register n (AR7 — ARO) 

addr 24-bit immediate address (label) 

cond condition code (see Chapter 1 1 ) 

ST status register 

RE repeat interrupt register 

RS repeat start register 

PC program counter 

C carry bit 
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Table 2-2. Instruction Set Summary (Continued) 



Mnemonic 


Description 


Operation 


BR 


Branch unconditionally (standard) 


Value -> PC 


BRD- 


Branch unconditionally (delayed) 


Value — > PC 


CALL 


Call subroutine 


PC + 1 -> TOS 
Value -» PC 


CALLcond 


Call subroutine conditionally 


If cond = true: 
PC + 1 -> TOS 

If Csrc is a register, Csrc — > PC 
If Csrc is a value, Csrc + PC — » PC 

Else, PC + 1 -» PC 


CMPF 


Compare floating-point values 


Set flags on Rn - src 


CMPF3 


Compare floating-point values 
(3-operand) 


Set flags on srd - src2 


CMPI 


Compare integers 


Set flags on Dreg - src 


CMPI3 


Compare integers (3-operand) 


Set flags on srd - src2 


DBcond 


Decrement and branch conditionally 
(standard) 


ARn - 1 — > ARn 

If cond = true and ARn > 0: 
If Csrc is a register, Csrc — > PC 
If Csrc is a value, Csrc + PC + 1 — > PC 

Else, PC + 1 -> PC 


DBcondD 


Decrement and branch conditionally 
(delayed) 


ARn - 1 -» ARn 

If cond = true and ARn > 0: 
If Csrc is a register, Csrc — > PC 
If Csrc is a value, Csrc + PC + 3 — » PC 

Else, PC + 1 -» PC 


FIX 


Convert floating-point value to integer 


Fix (src) — > Dreg 


FLOAT 


Convert integer to floating-point value 


Float(src) — > Rn 


IACK 


Interrupt acknowledge 


Dummy read of src 

IACK toggled low, then high 


IDLE 


Idle until interrupt 


PC + 1 -> PC 

Idle until next interrupt 


LDE 


Load floating-point exponent 


src(exponent) — > Rn(exponent) 


LDF 


Load floating-point value 


src — > Rn 


LDFcond 


Load floating-point value conditionally 


If cond = true, src — > Rn 
Else, Rn is not changed 


LDFI 


Load floating-point value, interlocked 


Signal interlocked operation src — > Rn 


LDI 


Load integer 


src — > Dreg 


LD\cond 


Load integer conditionally . 


If cond = true, src — » Dreg 
Else, Dreg is not changed 
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Table 2-2. Instruction Set Summary (Continued) 



Mnemonic 


Description 


Operation 


LDII 


Load integer, interlocked 


Signal interlocked operation src — > Dreg 


LDM 


Load floating-point mantissa 


src (mantissa) — > Rn (mantissa) 


LSH 


Logical shift 


If count > 0: 

(Dreg left-shifted by count) — > Dreg 
Else: 

(Dreg right-shifted by |count|) — > Dreg 


LSH3 


Logical shift (3-operand) 


If count > 0: 

(src left-shifted by count) — > Dreg 
Else: 
(src right-shifted by |count|) —» Dreg 


MPYF 


Multiply floating-point values 


src X Rn — > Rn 


MPYF3 


Multiply floating-point value (3-operand) 


srd x src2 — > Rn 


MPYI 


Multiply integers 


src x Dreg — > Dreg 


MPYI3 


Multiply integers (3-operand) 


srd x src2 — > Dreg 


NEGB 


Negate integer with borrow 


- src - C -> Dreg 


NEGF 


Negate floating-point value 


- src — > Rn 


NEGI 


Negate integer 


- src — > Dreg 


NOP 


No operation 


Modify ARn if specified 


NORM 


Normalize floating-point value 


Normalize (src) — > Rn 


NOT 


Bitwise logical-complement 


src — > Dreg 


OR 


Bitwise logical-OR 


Dreg OR src — » Dreg 


OR3 


Bitwise logical-OR (3-operand) 


srd OR src2 — > Dreg 


POP 


Pop integer from stack 


*SP >Dreg 


POPF 


Pop floating-point value from stack 


*SP >Rn 


PUSH 


Push integer on stack 


Sreg — > *++ SP 



LEGEND: 

src general addressing modes 

srd three-operand addressing modes 

src2 three-operand addressing modes 

Csrc conditional-branch addressing modes 

Sreg register address (any register) 

count shift value (general addressing modes) 

SP stack pointer 

GlE global interrupt enable register 

RM repeat mode bit 

TOS top of stack 



Dreg register address (any register) 

Rn register address (R7 — R0) 

Daddr destination memory address 

ARn auxiliary register rr (AR7 — ARO) 

addr 24-bit immediate address (label) 

cond condition code (see Chapter 1 1 ) 

ST status register 

RE repeat interrupt register 

RS repeat start register 

PC program counter 

C carry bit 
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Table 2-2. Instruction Set Summary (Continued) 



Mnemonic 


Description 


Operation 


PUSHF 


Push floating-point value on stack 


Rn— > *++SP 


RETIcond 


Return from interrupt conditionally 


If cond = true or missing: 

*SP > PC 

1 -> ST (GIE) 
Else, continue 


RETScond 


Return from subroutine conditionally 


If cond = true or missing: 

*SP > PC 

Else, continue 


RND 


Round floating-point value 


Round (src) — > Rn 


ROL 


Rotate left 


Dreg rotated left 1 bit — > Dreg 


ROLC 


Rotate left through carry 


Dreg rotated left 1 bit through carry — > Dreg 


ROR 


Rotate right 


Dreg rotated right 1 bit — > Dreg 


RORC 


Rotate right through carry 


Dreg rotated right 1 bit through carry — > 
Dreg 


RPTB 


Repeat block of instructions 


src — > RE 
1 -> ST (RM) 
Next PC -> RS 


RPTS 


Repeat single instruction 


src — > RC 
1 -> ST (RM) 
Next PC -> RS 
Next PC -» RE 


SIGI 


Signal, interlocked 


Signal interlocked operation 
Wait for interlock acknowledge 
Clear interlock 


STF 


Store floating-point value 


Rn -> Daddr 


STFI 


Store floating-point value, interlocked 


Rn — > Daddr 

Signal end of interlocked operation 


STI 


Store integer 


Sreg -> Daddr 


STII 


Store integer, interlocked 


Sreg -> Daddr 

Signal end of interlocked operation 


SUBB 


Subtract integers with borrow 


Dreg - src - C — » Dreg 


SUBB3 


Subtract integers with borrow (3-operand) 


srd - src2 - C — > Dreg 


SUBC 


Subtract integers conditionally 


If Dreg - src > 0: 

[(Dreg - src) « 1] OR 1 -> Dreg 

Else, Dreg « 1 — » Dreg 
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Table 2-2. Instruction Set Summary (Concluded) 



Mnemonic 


Description 


Operation 


SUBF 


Subtract floating-point values 


Rn - src — » Rn 


SUBF3 


Subtract floating-point values (3-operand) 


srd - src2 — > Rn 


SUBI 


Subtract integers 


Dreg - src — > Dreg 


SUBI3 


Subtract integers (3-operand) 


srd - src2 —> Dreg 


SUBRB 


Subtract reverse integer with borrow 


src - Dreg - C — > Dreg 


SUBRF 


Subtract reverse floating-point value 


src - Rn — > Rn 


SUBRI 


Subtract reverse integer 


src - Dreg — > Dreg 


SWI 


Software interrupt 


Perform emulator interrupt sequence 


TRAPcond 


Trap conditionally 


If cond = true or missing: 

Next PC -> * ++ SP 

Trap vector N — » PC 

-> ST (GIE) 
Else, continue 


TSTB 


Test bit fields 


Dreg AND src 


TSTB3 


Test bit fields (3-operand) 


srd AND src2 


XOR 


Bitwise exclusive-OR 


Dreg XOR src -» Dreg 


XOR3 


Bitwise exclusive-OR (3-operand) 


srd XOR src2 -> Dreg 



LEGEND: 






src 


general addressing modes 


Dreg 


srd 


three-operand addressing modes 


Rn 


src2 


three-operand addressing modes 


Daddr 


Csrc 


conditional-branch addressing modes 


ARn 


Sreg 


register address (any register) 


addr 


count 


shift value (general addressing modes) 


cond 


SP 


stack pointer 


ST 


GIE 


global interrupt enable register 


RE 


RM 


repeat mode bit 


RS 


TOS 


top of stack 


PC 
C 



register address (any register) 
register address (R7 — RO) 
destination memory address 
auxiliary register n (AR7 — ARO) 
24-bit immediate address (label) 
condition code (see Chapter 11) 
status register 
repeat interrupt register 
repeat start register 
program counter 
carry bit 
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Table 2-3. Parallel Instruction Set Summary 



Mnemonic 


Description 


Operation 


Parallel Arithmetic With Store Instructions 


ABSF 

IISTF 


Absolute value of a floating-point 


|src2| — > dstl 
|| src3 — » dst2 


ABSI 
HSTI 


Absolute value of an integer 


|src2| — > dstl 
|| src3 — > dst2 


ADDF3 
||STF 


Add floating-point 


srd + src2 — » dstl 
|| src3 — > dst2 


ADDI3 
HSTI 


Add integer 


srd + src2 — > dstl 
|| src3 — > dst2 


AND3 
HSTI 


Bitwise logical-AND 


srd AND src2 -* dstl 
|| src3 -> dst2 


ASH3 
HSTI 


Arithmetic shift 


If count > 0: 

src2 « count — > dstl 
|| src3 — » dst2 
Else: 

src2 » |count| — > dstl 
|| src3 — > dst2 


FIX 

HSTI 


Convert floating-point to integer 


Fix(src2) -» dstl 
|| src3 -> dst2 


FLOAT 

||STF 


Convert integer to floating-point 


Float(src2) -> dstl - 
|| src3 — > dst2 


LDF 

||STF 


Load floating-point 


src2 — > dstl 
|| src3 — > dst2 


LDI 

iisti 


Load integer 


src2 — > dstl 
|| src3 -» dst2 


LSH3 
HSTI 


Logical shift 


If count > 0: 

src2 « count — > dstl 
|| src3 — > dst2 
Else: 

src2 » |count| —> dstl 
|| src3 -> dst2 


MPYF3 
HSTF 


Multiply floating-point 


srd x src2 — > dstl 
|| src3 — > dst2 


MPYI3 
HSTI 


Multiply integer 


srd x src2 — > dstl 
|| src3 -» dst2 


NEGF 
||STF 


Negate floating-point 


0- src2 -» dstl 
|| src3 — > dst2 
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Table 2-3. Parallel Instruction Set Summary (Concluded) 



Mnemonic 


Description 


Operation 


Parallel Arithmetic With Store Instructions (Concluded) 


NEGI 
||STI 


Negate integer 


- src2 — > dstl 
|| src3 -> dst2 


NOT 
||STI 


Complement 


ircT — > dstl 
|| src3 — > dst2 


OR3 
|{ ST! 


Bitwise logical-OR 


srd OR src2 — > dstl 
|| src3 — » dst2 


STF 
|| STF 


Store floating-point 


srd — > dstl 
|| src3-»dst2 


STI 
||STI 


Store integer 


srd — > dstl 
|| src3 — > dst2 


SUBF3 
|| STF 


Subtract floating-point 


srd - src2 — » dstl 
||src3— >dst2 


SUBI3 
II STI 


Subtract integer 


srd - src2 — > dstl 
|| src3 -> dst2 


XOR3 

II STI 


Bitwise exclusive-OR 


srd XOR src2 -> dstl 
|| src3 -» dst2 


Parallel Load Instructions 


LDF 
|| LDF 


Load floating-point 


src2 -» dstl 
|| src4 -» dst2 


LDI 
||LDI 


Load integer 


src2 — > dstl 
|| src4 — » dst2 


Parallel Multiply And Add/Subtract Instructions 


MPYF3 
|| ADDF3 


Multiply and add floating-point 


op1 x op2 — > op3 
|| op4 + op5 — > op6 


MPYF3 
|| SUBF3 


Multiply and subtract floating-point 


op1 x op2 — > op3 
||op4-op5— »op6 


MPYI3 
|| ADDI3 


Multiply and add integer 


op1 x op2 — » op3 
|| op4 + op5 — > op6 


MPYI3 
|| SUBI3 


Multiply and subtract integer 


op1 x op2 — > op3 
|| op4-op5— >op6 



LEGEND: 
srd 
src3 
dstl 
op3 



register addr(R7— RO) 
register addr (R7 — RO) 
register addr (R7 — RO) 
register addr (RO or R1 ) 



src2 indirect addr (disp = 0, 1 , IRO, IR1 ) 

src4 indirect addr (disp = 0, 1 , IRO, IR1 ) 

dst2 indirect addr (disp = 0, 1 , IRO, IR1 ) 

op6 register addr (R2 or R3) 



op1 ,op2,op4,op5 - Two of these operands must be specified using register addr, and two must be specified 
using indirect. 
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2.3 Internal Bus Operation 



A large portion of the TMS320C3x's high performance is due to internal busing 
and parallelism. The separate program buses (PADDR and PDATA), data 
buses (DADDR1, DADDR2, and DDATA), and DMA buses (DMAADDR and 
DMADATA) allow for parallel program fetches, data accesses, and DMA ac- 
cesses. These buses connect all of the physical spaces (on-chip memory, 
off-chip memory, and on-chip peripherals) supported by the TMS320C30. 
Figure 2-3 shows these internal buses and their connection to on-chip and off- 
chip memory blocks. 

The program counter (PC) is connected to the 24-bit program address bus 
(PADDR). The instruction register (IR) is connected to the 32-bit program data 
bus (PDATA). These buses can fetch a single instruction word every machine 
cycle. 

The 24-bit data address buses (DADDR1 and DADDR2) and the 32-bit data 
data bus (DDATA) support two data memory accesses every machine cycle. 
The DDATA bus carries data to the CPU over the CPU1 and CPU2 buses. The 
CPU1 and CPU2 buses can carry two data memory operands to the multiplier, 
ALU, and register file every machine cycle. Also internal to the CPU are regis- 
ter buses REG1 and REG2 that can carry two data values from the register file 
to the multiplier and ALU every machine cycle. Figure 2-2 shows the buses 
internal to the CPU section of the processor. 

The DMA controller is supported with a 24-bit address bus (DMAADDR) and 
a 32-bit data bus (DMADATA). These buses allow the DMA to perform memory 
accesses in parallel with the memory accesses occurring from the data and 
program buses. 
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2.4 External Bus Operation 



The TMS320C30 provides two external interfaces: the primary bus and the ex- 
pansion bus. The TMS320C31 provides one external interface: the primary 
bus. Both primary and expansion buses consist of a 32-bit data bus and a set 
of control signals. The primary bus has a 24-bit address bus, whereas the ex- 
pansion bus has a 1 3-bit address bus. Both buses can be used to address ex- 
ternal program/data memory or I/O space. The buses also have an external 
RDY signal for wait-state generation. Additional wait states may be inserted 
under software control. Refer to Chapter 7 for detailed information on external 
bus operation. 



2.4.1 External Interrupts 



The TMS320C3x supports four external interru pts (INT3 -INT0), a number of 
internal interrupts, and a nonmaskable external RESET signal. These can be 
used to interr upt eit her the DMA or the CPU. When the CPU responds to the 
interrupt, the IACK pin can be used to signal an e xternal i nterrupt acknowl- 
edge. Section 6.5 (beginning on page 6-16) cover RESET and interrupt pro- 
cessing. 



2.4.2 Interlocked-lnstruction Signaling 



Two external I/O flags, XFO and XF1 , can be configured as input or output pins 
under software control. These pins are also used by the interlocked operations 
of the TMS320C3x. The interlocked-operations instruction group supports 
multiprocessor communication (see Section 6.4 on page 6-10 for examples of 
the use of interlocked instructions). 
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2.5 Peripherals 



All TMS320C3x peripherals are controlled through memory-mapped registers 
on a dedicated peripheral bus. This peripheral bus is composed of a 32-bit data 
bus and a 24-bit address bus. This peripheral bus permits straightforward com- 
munication to the peripherals. The TMS320C3x peripherals include two timers 
and two serial ports (only one serial port is available on the TMS320C31). 
Figure 2-6 shows the peripherals with associated buses and signals. Refer to 
Chapter 8 for detailed information on the peripherals. 



Figure 2-6. Peripheral Modules 
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2.5.1 Timers 



2.5.2 Serial Ports 



Peripherals 



The two timer modules are general-purpose 32-bit timer/event counters with 
two signaling modes and internal or external clocking. Each timer has an I/O 
pin that can be used as an input clock to the timer or as an output signal driven 
by the timer. The pin may also be configured as a general-purpose I/O pin. 



The two bidirectional serial ports are totally independent. They are identical 
with a complementary set of control registers controlling each one. Each serial 
port can be configured to transfer 8, 1 6, 24, or 32 bits of data per word. The 
clock for each serial port can originate either internally or externally. An inter- 
nally generated divide-down clock is provided. The serial port pins are confi- 
gurable as general-purpose I/O pins. The serial ports can also be configured 
as timers. A special handshake mode allows TMS320C3xs to communicate 
over their serial ports with guaranteed synchronization. 
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2.6 Direct Memory Access (DMA) 



The on-chip Direct Memory Access (DMA) controller can read from or write to 
any location in the memory map without interfering with the operation of the 
CPU. Therefore, the TMS320C3x can interface to slow external memories and 
peripherals without reducing throughput to the CPU. The DMA controller con- 
tains its own address generators, source and destination registers, and trans- 
fer counter. Dedicated DMA address and data buses allow for minimization of 
conflicts between the CPU and the DMA controller. A DMA operation consists 
of a block or single-word transfer to or from memory. Refer to Chapter 8 for de- 
tailed information on the DMA. Figure 2-7shows the DMAcontroller with asso- 
ciated buses. 



Figure 2-7. DMA Controller 



L 



DMADATA Bus 



M 



DMAADDRBUS: 



~7 



DMA Controller 



Global Control Register 



Source Address Register 



Destination Address 
Register 



Transfer Counter Register 



w 



> 

-A u 
s 



2-26 



Architectural Overview 



System Integration 



2.7 System Integration 



In summary, the TMS320C3x is a powerful DSP system because of its integra- 
tion of an innovative, high-performance CPU, two external interface ports, 
large memories, and efficient buses to support its speed. A single chip con- 
tains this system along with peripherals such as a DMA controller, two serial 
ports, and two timers. The total system real estate and price have been re- 
duced, providing the user with a true single-chip solution. 
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Chapter 3 

CPU Registers, Memory, and Cache 



The CPU registerfile contains 28 registers that can be operated on by the multi- 
plier and ALU (arithmetic logic unit). Included in the registerfile are the auxiliary 
registers, extended-precision registers, and index registers. The registers in 
the CPU register file support addressing, floating-point/integer operations, 
stack management, processor status, block repeats, and interrupts. 

The TMS320C3x provides a total memory space of 1 6M (million) 32-bit words 
containing program, data, and I/O space. Two RAM blocks of 1 K x 32 bits each 
and a ROM block, available only on the TMS320C30, of 4K x 32 bits permit two 
CPU accesses in a single cycle. The memory maps for the microcomputer and 
microprocessor modes are similar, except that the on-chip ROM is not used 
in microprocessor mode. 

A 64 x 32-bit instruction cache stores often repeated sections of code. This 
greatly reduces the number of off -chip accesses and allows code to be stored 
off-chip in slower, lower-cost memories. Three bits in the CPU status register 
control the clear, enable, or freeze of the cache. 

This chapter describes in detail each of the CPUregisters, the memory maps, 
and the instruction cache. Major topics are as follows: 

Q CPU Register File (Section 3.1 on page 3-3) 

■ Extended-precision registers (R7 — RO) 

■ Auxiliary registers (AR7 — ARO) 

■ Index registers (IRO, IR1) 

■ Block-size register (BK) 

■ Data-page pointer (DP) 

■ System stack pointer (SP) 

■ Status register (ST) 

■ CPU/DMA interrupt enable register (IE) 

■ CPU interrupt flag register (IF) 

■ I/O flags register (IOF) 
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■ Repeat-counter (RC) and block-repeat registers (RS, RE) 

■ Program counter (PC) 

Q Memory (Section 3.2 on page 3-12) 

■ Memory maps 

■ Peripheral bus map 

■ Reset/interrupt/trap map 

Ql Instruction Cache (Section 3.3 on page 3-1 9) 

■ Cache architecture 

■ Cache algorithm 

■ Cache control bits 
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3.1 CPU Register File 



The TMS320C3x provides 28 registers in a multiport register file that is tightly 
coupled to the CPU. The PC is not included in the 28 registers. All of these reg- 
isters can be operated upon by the multiplier and ALU, and can be used as gen- 
eral-purpose 32-bit registers. However, the registers also have some special 
functions for which they are particularly appropriate. For example, the eight ex- 
tended-precision registers are especially suited for maintaining extended-pre- 
cision floating-point results. The eight auxiliary registers support a variety of 
indirect addressing modes and can be used as general-purpose 32-bit integer 
and logical registers. The remaining registers provide system functions such 
as addressing, stack management, processor status, interrupts, and block re- 
peat. Refer to Chapter 5 for detailed information and examples of the use of 
CPU registers in addressing. 

The registers names and assigned function are listed in Table 3-1. 



Table 3-1. CPU Registers 



Register 


Assigned Function Name 


RO 
R1 
R2 
R3 
R4 
R5 
R6 
R7 


Extended-precision register 
Extended-precision register 1 
Extended-precision register 2 
Extended-precision register 3 
Extended-precision register 4 
Extended-precision register 5 
Extended-precision register 6 
Extended-precision register 7 


ARO 
AR1 
AR2 
AR3 
AR4 
AR5 
AR6 
AR7 


Auxiliary register 
Auxiliary register 1 
Auxiliary register 2 
Auxiliary register 3 
Auxiliary register 4 
Auxiliary register 5 
Auxiliary register 6 
Auxiliary register 7 


DP 
IRO 
IR1 
BK 
SP 


Data-page pointer 
Index register 
Index register 1 
Block-size register 
System stack pointer 


ST 

IE 

IF 

IOF 


Status register 
CPU/DMA interrupt enable 
CPU interrupt flags 
I/O flags 


RS 
RE 
RC 


Repeat start address 
Repeat end address 
Repeat counter 


PC 


Program counter 
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3.1 .1 Extended-Precision Registers (R7 — RO) 
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The eight extended-precision registers (R7 — RO) are capable of storing and 
supporting operations on 32-bit integer and 40-bit floating-point numbers. 
These registers consist of two separate and distinct regions: 

Q bits 39 — 32: dedicated to storage of the exponent (e) of the floating-point 
number. 

Ql bits 31 — 0: store the mantissa of the floating-point number: 

■ bit 31: sign bit (s), 

■ bits 30 — 0: the fraction (f). 

Any instruction that assumes the operands are floating-point numbers uses 
bits 39 — 0. Figure 3-1 illustrates the storage of 40-bit floating-point numbers 
in the extended-precision registers. 



Figure 3-1. Extended-Precision Register Floating-Point Format 
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For integer operations, bits 31 — Oof the extended-precision registers contain 
the integer (signed or unsigned). Any instruction that assumes the operands 
are either signed or unsigned integers uses only bits 31 — 0. Bits 39 — 32 re- 
main unchanged. This is true for all shift operations. The storage of 32-bit inte- 
gers in the extended-precision registers is shown in Figure 3-2. 



Figure 3-2. Extended-Precision Register Integer Format 
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3.1 .2 Auxiliary Registers (AR7 — AR0) 

The eight 32-bit auxiliary registers (AR7 — AR0) can be accessed by the CPU 
and modified by the two Auxiliary Register Arithmetic Units (ARAUs). The pri- 
mary function of the auxiliary registers is the generation of 24-bit addresses. 
However, they can also be used as loop counters in indirect addressing or as 
32-bit general-purpose registers that can be modified by the multiplier and 
ALU. Refer to Chapter 5 for detailed information and examples of the use of 
auxiliary registers in addressing. 
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3.1 .3 DatarPage Pointer (DP) 



The data-page pointer (DP) is a 32-bit register, which is loaded using the LDP 
instruction. The eight LSBs of the data-page pointer are used by the direct ad- 
dressing mode as a pointer to the page of data being addressed. Data pages 
are 64 K words long with a total of 256 pages. Bits 31 — 8 are reserved; you 
should always keep these zero. 



3.1 .4 Index Registers (IRO, 1R1) 



The 32-bit index registers (IRO and IR1) are used by the Auxiliary Register 
Arithmetic Unit (ARAU) for indexing the address. Refer to Chapter 5 for de- 
tailed information and examples of the use of index registers in addressing. 



3.1 .5 Block-Size Register (BK) 



The 32-bit block-size register (BK) is used by the ARAU in circular addressing 
to specify the data block size (see Section 5.3 on page 5-24). 



3.1 .6 System Stack Pointer (SP) 



The system stack pointer (SP) is a 32-bit register that contains the address of 
the top of the system stack. The SP always points to the last element pushed 
onto the stack. The SP is manipulated by interrupts, traps, calls, returns, and 
the PUSH, PUSHF, POP, and POPF instructions. Pushes and pops of the stack 
perform preincrement and postdecrement, respectively, on all 32 bits of the 
stack pointer. However, only the 24 LSBs are used as an address. Refer to Sec- 
tion 5.5 on page 5-30 for information about system stack management. 



3.1 .7 Status Register (ST) 



The status register (ST) contains global information relating to the state of the 
CPU. Typically, operations set the condition flags of the status register accord- 
ing to whether the result is zero, negative, etc. This includes register load and 
store operations as well as arithmetic and logical functions. When the status 
register is loaded, however, the contents of the source operand replace the 
current contents bit-for-bit, regardless of the state of any bits in the source op- 
erand. Therefore, following a load, the contents of the status register are identi- 
cally equal to the contents of the source operand. This allows the status regis- 
ter to be saved easily and restored. At system reset, is written to this register. 
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The format of the status register is shown in Figure 3-3. Table 3-2 defines the 
status register bits, their names, and functions. 



Figure 3-3. Status Register 
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NOTE: xx = reserved bit. 

R = read, W = write. 
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Table 3-2. Status Register Bits Summary 



Bit 


Name 


Reset Value 


Function 


ot 


C 





Carry flag. 


It 


V 





Overflow flag. 


2t 


Z 





Zero flag. 


3t 


N 





Negative flag. 


4t 


UF 





Floating-point underflow flag. 


5t 


LV 





Latched overflow flag. 


6t 


LUF 





Latched floating-point underflow flag. 


7 


OVM 





Overflow mode flag. This flag affects only the integer operations. If OVM 
= 0, the overflow mode is turned off; integer results that overflow are 
treated in no special way. If OVM = 1 , 

a) integer results overflowing in the positive direction are set to the 
most positive 32-bit twos-complement number (7FFFFFFFh) 

b) integer results overflowing in the negative direction are set to the 
most negative 32-bit twos-complement number (80000000h). 

Note that the function of V and LV is independent of the setting of OVM. 


8 


RM 





Repeat mode flag. If RM = 1 , the PC is being modified in either the 
repeat-block or repeat-single mode. 


9 


Reserved 





Read as 0. 


10 


CF 





Cache freeze. When CF = 1 , the cache is frozen. If the cache is enabled 
(CE = 1 ), fetches from the cache are allowed, but no modification of the 
state of the cache is performed. This function can be used to save fre- 
quently used code resident in the cache. At reset, is written to this bit. 
Cache clearing (CC=1 ) is allowed when CF=0. 


11 


CE 





Cache enable. CE = 1 enables the cache, allowing the cache to be used 
according to the least recently used (LRU) cache algorithm. CE = dis- 
ables the cache; no update or modification of the cache can be per- 
formed. No fetches are made from the cache. This function is useful for 
system debug. At system reset, is written to this bit. Cache clearing 
(CC = 1) is allowed when CE=0. 


12 


CC 





Cache clear. CC = 1 invalidates all entries in the cache. This bit is always 
cleared after it is written to and thus always read as 0. At reset, is written 
to this bit. 


13 


GIE 





Global interrupt enable. If G IE = 1 , the CPU responds to an enabled inter- 
rupt. If GIE = 0, the CPU does not respond to an enabled interrupt. 


15 — 14 


Reserved 





Read as 0. 


31—16 


Reserved 


0-0 


Value undefined. 



t The seven condition flags (ST bits 6 — 0) are defined in Section 1 0.2 on page 1 0-9. 
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3.1 .8 CPU/DMA Interrupt Enable Register (IE) 



The CPU/DMA interrupt enable register (IE) is a 32-bit register (see 
Figure 3-4). The CPU interrupt enable bits are in locations 10 — 0. The DMA 
interrupt enable bits are in locations 26 — 1 6. A 1 in a CPU/DMA interrupt en- 
able register bit enables the corresponding interrupt. A disables the corre- 
sponding interrupt. At reset, is written to this register. Table 3-3 defines the 
register bits, the bit names, and the bit functions. 



Figure 3-4. CPU/DMA Interrupt Enable Register (IE) 
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xx = reserved bit, read as 0. 
R = read, W = write. 
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Table 3-3. IE Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





EINTO 





Enable external interrupt (CPU) 


1 


EINT1 





Enable external interrupt 1 (CPU) 


2 


EINT2 





Enable external interrupt 2 (CPU) 


3 


EINT3 





Enable external interrupt 3 (CPU) 


4 


EXINTO 





Enable serial-port transmit interrupt (CPU) 


5 


ERINTO 





Enable serial-port receive interrupt (CPU) 


6 


EXINT1 





Enable serial-port 1 transmit interrupt (CPU) 


7 


ERINT1 





Enable serial-port 1 receive interrupt (CPU) 


8 


ETINTO 





Enable timer interrupt (CPU) 


9 


ETINT1 





Enable timer 1 interrupt (CPU) 


10 


EDINT 





Enable DMA controller interrupt (CPU) 


15 — 11 


Reserved 





Value undefined 


16 


EINTO 





Enable external interrupt (DMA) 


17 


EINT1 





Enable external interrupt 1 (DMA) 


18 


EINT2 





Enable external interrupt 2 (DMA) 


19 


EINT3 





Enable external interrupt 3 (DMA) 


20 


EXINTO 





Enable serial-port transmit interrupt (DMA) 


21 


ERINTO 





Enable serial-port receive interrupt (DMA) 


22 


EXINT1 





Enable serial-port 1 transmit interrupt (DMA) 


23 


ERINT1 





Enable serial-port 1 receive interrupt (DMA) 


24 


ETINTO 





Enable timer interrupt (DMA) 


25 


ETINT1 





Enable timer 1 interrupt (DMA) 


26 


EDINT 





Enable DMA controller interrupt (DMA) 


31—27 


Reserved 


0-0 


Value undefined 



3.1 .9 CPU Interrupt Flag Register (IF) 



The 32-bit CPU interrupt flag register (IF) is shown in Figure 3-5. A 1 in a CPU 
interrupt flag register bit indicates that the corresponding interrupt is set. The 
IF bits are set to 1 when an interrupt occurs. They may also be set to 1 through 
software to cause an interrupt. A indicates that the corresponding interrupt 
is not set. If a is written to an interrupt flag register bit, the corresponding inter- 
rupt is cleared. At reset, is written to this register. Table 3-4 lists the bit fields, 
bit field names, and bit field functions of the CPU interrupt flag register. 
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Figure 3-5. CPU Interrupt Flag Register (IF) 
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Table 3-4. IF Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





INTO 





External interrupt flag 


1 


INT1 





External interrupt 1 flag 


2 


INT2 





External interrupt 2 flag 


3 


INT3 





External interrupt 3 flag 


4 


XINTO 





Serial-port transmit interrupt flag 


5 


RINTO 





Serial-port receive interrupt flag 


6 


XINT1T 





Serial-port 1 transmit interrupt flag 


7 


RINT1T 





Serial-port 1 receive interrupt flag 


8 


TINTO 





Timer interrupt flag 


9 


TINT1 





Timer 1 interrupt flag 


10 


DINT 





DMA channel interrupt flag 


31—11 


Reserved 


0-0 


Value undefined 



t Reserved on TMS320C31. 



3.1.10 I/O Flags Register (IOF) 



The I/O flags register (IOF) is shown in Figure 3-6 and controls the function 
of the dedicated external pins, XFO and XF1 . These pins may be configured 
for input or output. They may also be read from and written to. At reset, is writ- 
ten to this register. The bit fields, bit field names, and bit field functions are 
shown in Table 3-5. 



Figure 3-6. I/O Flag Register (IOF) 



31 30 29 28 27 26 25 24 



20 



XX | XX | XX | XX | XX | XX | XX | XX I 



M 
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17 
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| xx j xx | xx | xx | xx | xx | xx | xx | INXF1 | OUTXF1 | I/OXF1 | xx | INXFO | OUTXFO | i/OXFO | s 



NOTE: 



xx = reserved bit, read as 0. 
R = read, W = write. 



RAW 



RAW 



RAW 



RAW 
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Table 3-5. IOF Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





Reserved 





Read as 0. 


1 


I/OXFO 





If I/OXFO = 0, XFO is configured as a general-purpose input pin. 
If I/OXFO = 1 , XFO is configured as a general-purpose output pin. 


2 


OUTXFO 





Data output on XFO. 


3 


INXFO 





Data input on XFO. A write has no effect. 


4 


Reserved 





Read as 0. 


5 


I/OXF1 





If I/OXF1 = 0, XF1 is configured as a general-purpose input pin. 
If I/OXF1 ss 1 , XF1 is configured as a general-purpose output pin. 


6 


OUTXF1 





Data output on XF1 . 


7 


INXF1 





Data input on XF1 . A write has no effect. 


31—8 


Reserved 


0-0 


Read as 0. 



3.1.11 Repeat-Count (RC) and Block-Repeat Registers (RS, RE) 

The repeat-count register (RC) is a 32-bit register used to specify the number 
of times a block of code is to be repeated when performing a block repeat. 

The repeat start address register (RS) is a 32-bit register containing the start- 
ing address of the block of program memory to be repeated when operating 
in the repeat mode. 

The 32-bit repeat end address register (RE) contains the ending address of the 
block of program memory to be repeated when operating in the repeat mode. 

3.1.12 Program Counter (PC) 

The program counter (PC) is a 32-bit register containing the address of the next 
instruction to be fetched. While the program counter is not part of the CPU reg- 
ister file, it is a register that can be modified by instructions that modify the pro- 
gram flow. 

3.1.13 Reserved Bits and Compatibility 

In order to retain compatibility with future members of the TMS320C3x family 
of microprocessors, reserved bits that are read as zero must be written as zero. 
Reserved bits that have an undefined value must not have their current value 
modified. In other cases, the user should maintain the reserved bits as speci- 
fied. 
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The TMS320C3x's total memory space of 1 6M (million) 32-bit words contains 
program, data, and I/O space, allowing tables, coefficients, program code, or 
data to be stored in either RAM or ROM. In this way, memory usage can be 
maximized and memory space allocated as desired. 

RAM blocks and 1 are each 1 K x 32 bits. The ROM block is 4K x 32 bits. Each 
on-chip RAM and ROM block is capable of supporting CPU two accesses in 
a single cycle. The separate program buses, data buses, and DMA buses allow 
for parallel program fetches, data reads/writes, and DMA operations. Chapter 
9 covers this in detail. 

3.2.1 TMS320C3x Memory Maps 

The memory map is dependent upon whether the processor is running in the 
microprocessor mode (MC/MP or MCBL/MP = 0) or the microcomputer mode 
(MC/MP or MCBL/MP = 1 ). The memory maps for these modes are similar (see 
Figure 3-7). Locations 800000h through 801 FFFh are mapped to the expan- 
sion bus . When this region, available only on the TMS320C30, is accessed, 
MSTRB is active. Locations 802000h through 803FFFh are reserved. Loca- 
tions 804000h through 805FFFh are mapped to the exp ansion bu s. When this 
region, available only on the TMS320C30, is accessed, IOSTRB is active. Lo- 
cations 806000h through 807FFFh are reserved. All of the memory-mapped 
peripheral registers are in locations 808000h through 8097FFh. In both modes, 
RAM block is located at addresses 809800h through 809BFFh, and RAM 
block 1 is located at addresses 809C00h through 809FFFh. Memory locations 
80A0 00h th rough OFFFFFFh are accessed over the primary external memory 
port (STRB active). 

In microprocessor mode, the 4K on-chip ROM (TMS320C30) or bootloader 
(TMS320C31 ) is not mapped into the TMS320C3x memory map. As shown in 
Figure 3-7, locations Oh through BFh consist of interrupt vector, trap vector, 
and reserved locatio ns, all of which are accessed over the primary external 
memory port (STRB active). Interrupt and trap vector locations are shown in 
Figure 3-9. Locations COh through 7FFFFFh are also accessed over the pri- 
mary external memory port. 

In microcomputer mode, the 4K on-chip ROM (TMS320C30) or bootloader 
(TMS320C31 ) is mapped into locations Oh through OFFFh. There are i 92 loca- 
tions (Oh through BFh) within this block for interrupt vectors, trap vectors, and 
a reserved space. Locations 1 0OOh th rough 7FFFFFh are accessed over the 
primary external memory port (STRB active). 

Do not read and write reserved portions of the TMS320C3x memory space and 
reserved peripheral bus addresses. Doing so may cause the TMS320C3x to 
halt operation and require a system reset to restart. 

3-12 CPU Registers, Memory, and Cache 



Figure 3-7. TMS320C30 Memory Maps 
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3.2.2 TMS320C31 Memory Maps 



Setting the state of the TMS320C31 MCBL/MP pin determines the mode in 
which the TMS320C31 can function: 

Qi Microprocessor mode (MCBL/MP = 0) 

□ Microcomputer/boot loader mode (MCBL/MP = 1 ) 

The major difference between these two modes is their memory maps (see 
Figure 3-8). The program boot load feature is enabled when the MCBL/MP pin 
is driven high during reset. 

Notice that special memory locations are used by the loader (internal and ex- 
ternal). They are identified in the microcomputer/boot loader memory map 
shown in Figure 3-8. 
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Figure 3-8. TMS320C31 Memory Maps 
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(b) Microcomputer/Boot Loader Mode 
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Boot 1-3 locations are used by the boot loader function. See Section 3.4 for 
a complete description. All reserved memory locations are described in 
Table 12-5 of Section 12.8. 



3.2.3 Reset/Interrupt/Trap Vector Map 



The addresses for the reset, interrupt, and trap vectors are Oh through 3Fh, as 
shown in Figure 3-9. The vectors stored in these locations are the addresses 
of the start of the respective reset, interrupt, and trap routines. For example, 
at reset, the contents of memory location Oh (the reset vector) are loaded into 
the PC and execution begins from that address. 

Traps 28 — 31 are reserved; do not use them. 
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Figure 3-9. Reset, Interrupt and Trap Vector Locations 
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t Reserved on TMS320C31 
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3.2.4 Peripheral Bus Map 



The memory-mapped peripheral registers are located starting at address 
808000h. The peripheral bus memory map is shown in Figure 3-1 0. Each pe- 
ripheral occupies a 16-word region of the memory map. Locations 80801 Oh 
through 80801 Fh and locations 808070h through 8097FFh are reserved. 



Figure 3-10. Peripheral-Bus Memory Map 
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t Reserved on TMS320C31 
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3.3 Instruction Cache 



A 64 x 32-bit instruction cache facilitates maximum system performance with 
minimal system cost by storing sections of code that can be fetched when re- 
peatedly accessing time-critical code. This reduces the number of off-chip ac- 
cesses necessary and allows for code to be stored off-chip in slower, lower- 
cost memories. The cache also frees external buses from program fetches so 
they can be used by the DMA or other system elements. 

The cache can operate in a completely automatic fashion without the need for 
user intervention. Section 3.3.2 describes a form of the LRU (least recently 
used) cache update algorithm. 



3.3.1 Cache Architecture 



The instruction cache (see Figure 3-11) contains 64 32-bit words of RAM and 
is divided into two 32-word segments. Associated with each segment is a 1 9-bit 
segment start address (SSA) register. For each word in the cache, there is a 
corresponding single bit: Present (P) flag. 



Figure 3-11. Instruction Cache Architecture 
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When the CPU requests an instruction word from external memory, a check 
is made to determine if the word is already contained in the instruction cache. 
Figure 3-11 shows the partitioning of an instruction address as used by the 
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cache control algorithm. The1 9 most significant bits of the instruction address 
are used to select the segment, and the 5 least significant bits define the ad- 
dress of the instruction word within the pertinent segment. The 1 9 MSBs of the 
instruction address are compared with the two segment start address (SSA) 
registers. If a match is found, a check is made of the relevant P flag. The P 
flag indicates whether or not the word within a particular segment is already 
present in cache memory. 



Figure 3- 12. Address Partitioning for Cache Control Algorithm 

23 5 4 



segment start address 
(SSA) 


instruction word 
address within segment 



If there is no match, one of the segments must be replaced by the new data. 
The segment replaced in this circumstance is determined by the LRU (least-re- 
cently-used) algorithm. The LRU stack (see Figure 3-1 1 ) is maintained for this 
purpose. 

The LRU stack determines which of the two segments qualifies as the least-re- 
cently-used after each access to the cache; therefore, the stack contains either 
0,1 or 1 ,0. Each time a segment is accessed, its segment number is removed 
from the LRU stack and pushed onto the top of the LRU stack. Therefore, the 
number at the top of the stack is the most recently used segment number, and 
the number at the bottom of the stack is the least recently used segment num- 
ber. 

At system reset, the LRU stack is initialized with at the top and 1 at the bottom. 
All P flags in the instruction cache are cleared. 

When a replacement is necessary, the least recently used segment is selected 
for replacement. Also, the 32 P flags for the segment to be replaced are set 
to 0, and the segment's SSA register is replaced with the 1 9 MSBs of the in- 
struction address. 

3.3.2 Cache Algorithm 

When the TMS320C3x requests an instruction word from external memory, 
one of two possible actions occurs: a cache hit or a cache miss. These are de- 
scribed in the following list: 

Q Cache Hit. The cache contains the requested instruction, and the follow- 
ing actions occur: 

■ The instruction word is read from the cache. 

■ The number of the segment containing the word is removed from the 
LRU stack and pushed to the top of the LRU stack, thus moving the 
other segment number to the bottom of the stack. 
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Qi Cache Miss. The cache does not contain the instruction. Types of cache 
miss are 

El Word miss. The segment address register matches the instruction ad- 
dress, but the relevant P flag is not set. The following actions occur in 
parallel: 

a The instruction word is read from memory and copied into the 
cache. 

a The number of the segment containing the word is removed from 
the LRU stack and pushed to the top of the LRU stack, thus moving 
the other segment number to the bottom of the stack. 

a The relevant P flag is set. 

Q Segment miss. Neither of the segment addresses matches the in- 
struction address. The following actions occur in parallel: 

n The least recently used segment is selected for replacement. The 
P flags for all 32 words are cleared. 

a The SSA register for the selected segment is loaded with the 1 9 
MSBs of the address of the requested instruction word. 

° The instruction word is fetched and copied into the cache. It goes 
into the appropriate word of the least recently used segment. The 
P flag for that word is set to 1 . 

n The number of the segment containing the instruction word is re- 
moved from the LRU stack and pushed to the top of the LRU stack, 
thus moving the other segment number to the bottom of the stack. 

Only instructions may be fetched from the program cache. All reads and writes 
of data in memory bypass the cache. Program fetches from internal memory 
do not modify the cache and do not generate cache hits or misses. The pro- 
gram cache is a single-access memory block. Dummy program fetches (i.e., 
following a branch) are treated by the cache as valid program fetches and can 
generate cache misses and cache updates. 

Take care when using self-modifying code. If an instruction resides in cache 
and the corresponding location in primary memory is modified, the copy of the 
instruction in cache is not modified. 

You can make more efficient use of the cache by aligning program code on 
32-word address boundaries. Do this by using the ALIGN directive when cod- 
ing assembly language. 
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3.3.3 Cache Control Bits 



Three cache control bits are located in the CPU status register: the cache clear 
bit (CC), the cache enable bit (CE), and the cache freeze bit (CF). 

Cache Clear Bit (CC). Writing a 1 to the cache clear bit (CC) invalidates all 
entries in the cache. All P flags in the cache are cleared. The CC bit is 
always cleared after the cache is cleared. It is therefore always read as 
a 0. At reset, the cache is cleared and is written to this bit. 

Cache Enable Bit (CE). Writing a 1 to this bit enables the cache. When en- 
abled, the cache is used according to the previously described cache 
algorithm. Writing a to the cache enable bit disables the cache; no 
updates or modification of the cache can be performed. Specifically, 
no SSA register updates are performed, no P flags are modified (un- 
less CC = 1 ) , and the LRU stack is not modified. Writing a 1 to CC when 
the cache is disabled clears the cache, and, thus, the P flags. No 
fetches are made from the cache when the cache is disabled. At reset, 
is written to this bit. 

Cache Freeze Bit (CF). When CF = 1 , the cache is frozen. If, in addition, the 
cache is enabled, fetches from the cache are allowed, but no modifica- 
tion of the state of the cache is performed. Specifically, no SSA register 
updates are performed, no P flags are modified (unless CC = 1 ), and 
the LRU stack is not modified. This function can be used to keep fre- 
quently used code resident in the cache. Writing a 1 to CC when the 
cache is frozen clears the cache, and, thus, the P flags. At reset, is 
written to this bit. 

Table 3-6 defines the effect of the CE and CF bits used in combination. 



Table 3-6. Combined Effect of the CE and CF Bits 



CE 


CF 


Effect 








Cache not enabled 





1 


Cache not enabled 


1 





Cache enabled and not frozen 


1 


1 


Cache enabled and frozen 
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3.4 Using the TMS320C31 Boot Loader 



This section describes how to use the TMS320C31 microcomputer/boot loader 
(MCBL/MP)function. This feature is unique to the TMS320C31 and is not avail- 
able on the TMS320C30 device. 



3.4.1 Boot Loader Operations 



The boot loader lets you load and execute programs that are received from a 
host processor, inexpensive EPROMs, or other standard memory devices. 
The programs to be loaded reside in one of three memory mapped areas identi- 
fied as Boot 1 , Boot 2, and Boot 3 (see the shaded areas of Figure 3-8 ), or 
the programs are received by means of the serial port. 

User-definable byte, half-word, and word data formats are supported. 32-bit 
fixed burst loads from the TMS320C31 serial port are also supported. See Sec- 
tion 8.2 for a detailed description of the serial port operation. 



3.4.2 Invoking the Boot Loader 



The boot loader function is selected by resetting the processor while driving 
the MCBL/MP pin high. Figure 3-1 3 shows the flow of this operation, which is 
dependent upon the mode selected (external memory or serial boot). 
Figure 3-1 4 shows memory load operations; Figure 3-1 5 shows serial port 
load operations. 



Figure 3-13. Boot Loader Mode Selection Flowchart 
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Figure 3-14. Boot Loader Memory Load Flowchart 
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Figure 3-15. Boot Loader Serial Port Load Mode Flowchart 
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3.4.3 Mode Selection 



After reset, the loader mode is determined by polling the status of the INT3- 
INTO pins. Table 3-7 lists the options that you ca n select. The options are 
based upon the active low state of the INT3-INT0 signals. The TMS320C31 
device begins reading data from the boot memory location sel ected by the ac- 
tive interrupt signal. Interrupts can be driven any time after the RESET pin has 
been deasserted. 
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Table 3-7. Loader Mode Selection 



Active Interrupt 


Loader Mode 


Memory Addresses 


Into 


External memory 


Boot 1 address 0x001 000 


IntT 


External memory 


Boot 2 address 0x400000 


TRT2 


External memory 


Boot 3 address OxFFFOOO 


INT3 


32-bit serial 


Serial port 



3.4.4 External Memory Loading 



Table 3-8 shows and describes the information that you must specify to define 
boot memory organization (8, 1 6, or 32 bits), the code block size, the load desti- 
nation address, and memory access timing control for the boot memory. This 
must be done before a source program can be externally loaded. 

This information must be specified in the first four locations of the Boot 1 , Boot 
2, or Boot 3 areas. The header is followed by the data or program code that 
is the BLK size in length. 



Table 3-8. External Memory Loader Header 



Location 


Description 


Valid Data Entries 





Boot memory type (8, 16, or 32) 


0x8, 0x10, or 0x20 specified as a 32-bit number. 


1 


Boot memory configuration 
(defined # of wait states, etc.) 


See Chapter 7 of the TMS320C3x User's Guide 
for valid bus control register entries. 


2 


Program block size (BLK) 


Any value < BLK < 224. 


3 


Destination address 


Any valid TMS320C31 24-bit address. 


4 


Program code starts here 


Any 32-bit data value or valid TMS320C3x in- 
struction. 



The loader fetches 32 bits of data for each specified location, regardless of 
what memory configuration width is specified. The data values must reside 
within or be written to memory, beginning with the value of least significance 
for each 32 bits of information. 



3.4.5 Examples of External Memory Loads 



Example 3-1, Example 3-2, and Example 3-3 show memory images for 
byte-wide, 1 6-bit wide, and 32-bit wide configured memory. 

These examples assume that 



1) an INTO signal was detected after reset is deasserted (external memory 
load from Boot 1). 

2) the loader header resides at memory location 0x1 000 and defines the fol- 
lowing: 
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a) boot memory type EPROMs that require two wait states and SWW = 
11. 

b) a loader destination address at the beginning of the TMS320C31 's in- 
ternal RAM Block 0, and 

c) a single block of memory that is 0x1 FF in length. 
Example 3-1. Byte-Wide Configured Memory 



Address 


Value 


Comments 


0x1000 


0x08 


Memory width = 8 bits 


0x1001 


0x00 


0x1 002 


0x00 


0x1003 


0x00 


0x1004 


0x58 


Memory type = SWW = 11 , WCNT = 2 


0x1005 


0x10 


0x1006 


0x00 


0x1007 


0x00 


0x1008 


OxFF 


Program code size = 0x1 FF 


0x1009 


0x01 


0x1 00A 


0x00 


0x1 00B 


0x00 


0x1 00C 


0x00 


Program load starting address = 0x8098p0 


0x1 00D 


0x98 


0x1 00E 


0x80 


0x1 OOF 


0x00 



Example 3-2. 16-Bit Wide Configured Memory 



Address 


Value 


Comments 


0x1000 


0x10 


Memory width = 16 


0x1001 


0x0000 


0x1002 


0x1058 


Memory type = SWW = 11 , WCNT = 2 


0x1003 


0x0000 


0x1004 


0x1 FF 


Program code size = 0x1 FF 


0x1005 


0x0000 


0x1006 


0x9800 


Program load starting address = 0x809800 


0x1007 


0x0080 
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Example 3-3. 32-Bit Wide Configured Memory 



Address 


Value 


Comments 


0x1000 


0x00000020 


Memory width = 32 


0x1001 


0x00001058 


Memory type = SWW = 11 , WCNT = 2 


0x1002 


0x000001 FF 


Program code size = 0x1 FF 


0x1003 


0x00809800 


Program load starting address = 0x809800 



After the header is read, the loader transfers BLK, 32-bit words beginning at 
a specified destination address. Code blocks require the same byte and half- 
word ordering conventions. Additionally, the loader can be used to load multi- 
ple code blocks at different address destinations. 

If multiple code blocks are loaded, execution begins at the first block of code 
loaded. Consequently, the first code block loaded should be a startup routine 
to access the other loaded programs. 

If another code block is to be loaded, the following header and its correspond- 
ing code must be appended to the preceding code block: 



BLK size 
Destination address 



1st location 
2nd location 



Repeat this procedure for additional code blocks. End the loader function and 
begin execution of the first code block by appending the value of 0x00000000 
to the last block. 



It is assumed that at least one block of code will be loaded when the 
loader is invoked. Initial loader invocation with a block size of 
0x00000000 produces unpredictable results. 



3.4.6 Serial Port Loading 



Boot loads, by way of the TMS320C31 serial port, are selected by driving the 
INT3 pin active lowfollowing reset. The loader automatically configures the se- 
rial port for 32-bit fixed burst reads. It is interrupt-driven by the FSR signal. You 
cannot change this mode for boot loads. The serial port clock and FSR are ex- 
ternally generated by your hardware. 

As in parallel loading, a header must precede the actual program to be loaded. 
However, only the block size and destination address must be provided be- 
cause serial port speed and data format are predefined by the loader and your 
hardware (i.e., skip data words and 1 from Table 3-8). 
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Using the TMS320C31 Boot Loader 



The transferred data-bit order must begin with the most significant bit (MSB) 
and end with the least significant bit (LSB). 



3.4.7 interrupt and I rap Vector Mapping 



Unlike the microprocessor mode, the microcomputer/boot loader (MCBL) 
mode uses a dual-vectoring scheme to service interrupt and trap requests. 
Dual vectoring was implemented to ensure code compatibility with future ver- 
sions of TMS320C3x devices. 

In a dual-vectoring scheme, branch instructions to an address, rather than di- 
rect interrupt vectoring, are used. The normal interrupt and trap vectors are de- 
fined to vector to the last 63 locations in the on-chip RAM. When the loader is 
invoked, the TMS320C31 's last 63 locations of RAM Block 1 are assumed to 
contain interrupt and trap branch instructions. 



Take care to ensure that these locations are not inadvertently over- 
written by loaded program or data values. 



Table 3-9 shows the MCBL/MP mode interrupt and trap instruction memory 
maps. 
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Table 3-9. TMS320C31 Interrupt and Trap Memory Maps 



Address 


Description 


809FC1 


Into 


809FC2 


IntT 


809 FC3 


INT2 


809FC4 


INT3 


809FC5 


XINTO 


809FC6 


RINTO 


809FC7 


Reserved 


809FC8 


Reserved 


809FC9 


TINTO 


809FCA 


TINT1 


809FCB 


DINTO 


809FCC-809FDF 


Reserved 


809FE0 


TRAPO 


809FE1 


TRAP1 


• 
• 
• 


• 
• 
• 


809FFB 


TRAP27 


809FFC-809FFF 


Reserved 
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Chapter 4 

Data Formats and Floating-Point Operation 



In the TMS320C3x architecture, data is organized into three fundamental 
types: integer, unsigned-integer, and floating-point. Note that the terms, inte- 
ger and signed-integer, are considered to be equivalent. The TMS320C3x sup- 
ports short and single-precision formats for signed and unsigned integers. It 
also supports short, single-precision and extended-precision formats for float- 
ing-point data. 

Floating-point operations make fast, trouble-free, accurate, and precise com- 
putations. Specifically, the TMS320C3x implementation of floating-point arith- 
metic facilitates floating-point operations at integer speeds while preventing 
problems with overflow, operand alignment, and other burdensome tasks com- 
mon in integer operations. 

This chapter discusses in detail the data formats and floating-point operations 
supported on the TMS320C3x. Major topics in this section are as follows: 

Qi Integer Formats (Section 4.1 on page 4-2) 

Qi Unsigned-Integer Formats (Section 4.2 on page 4-3) 

Qi Floating-Point Formats (Section 4.3 on page 4-4) 

□ Floating-Point Multiplication (Section 4.4 on page 4-1 0) 

Q Floating-Point Addition and Subtraction (Section 4.5 on page 4-1 4 ) 
Q Normalization (Section 4.6 on page 4-1 8) 

□ Rounding (Section 4.7 on page 4-20 ) 

d Floating-Point to Integer Conversions (Section 4.8 on page 4-22) 
Qi Integer to Floating-Point Conversions (Section 4.9 on page 4-24) 
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4.1 Integer Formats 



The TMS320C3x supports two integer formats: a 16-bit short integer format 
and a 32-bit single-precision integer format. When extended-precision regis- 
ters are used as integer operands, only bits 31 — are used; bits 39 — 32 re- 
main unchanged and unused. 



4.1.1 Short Integer Format 



The short integer format is a 1 6-bit twos-complement integer format used for 
immediate integer operands. For those instructions that assume integer oper- 
ands, this format is sign-extended to 32 bits (see Figure 4-1 ). The range of an 
integer si, represented in the short integer format, is -2 15 < si < 2 15 -1 . In 
Figure 4-1 , s = signed bit. 



Figure 4-1. Short Integer Format and Sign Extension of Short Integer 



15 



(a) Short Integer Format 



31 



16 15 



ssssssssssssssss s 



(b) Sign Extension of a Short Integer 



4.1.2 Single-Precision Integer Format 



In the single-precision integer format, the integer is represented in twos-com- 
plement notation. The range of an integer sp, represented in the single-preci- 
sion integer format, is- 2 31 < sp< 2 31 -1 . Figure 4-2 shows the single-preci- 
sion integer format. 



Figure 4-2. Single-Precision Integer Format 



31 
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4.2 Unsigned-Integer Formats 



Two unsigned-integer formats are supported on the TMS320C3x: a 16-bit 
short format and a 32-bit single-precision format. In extended-precision regis- 
ters, the unsigned-integer operands use only bits 31 — 0; bits 39 — 32 remain 
unchanged. 



4.2.1 Short Unsigned-Integer Format 



Figure 4-3 shows thel 6-bit, short, unsigned-integer format used for immedi- 
ate unsigned-integer operands. For those instructions that assume 
unsigned-integer operands, this format is zero-filled to 32 bits. In Figure 4-3 
below, x= MSB (1 orO). 



Figure 4-3. Short Unsigned-Integer Format and Zero Fill 



15 



31 



(a) Short Unsigned-Integer Format 
16 15 



000000 000000000 x 



(b) Zero Fill of a Short Unsigned Integer 



4.2.2 Single-Precision Unsigned-Integer Format 



In the single-precision unsigned-integer format, the number is represented as 
a 32-bit value, as shown in Figure 4-4. 



Figure 4-4. Single-Precision Unsigned-Integer Format 



31 
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4.3 Floating-Point Formats 



All TMS320C3x floating-point formats consist of three fields: an exponent field 
(e), a single-bit sign field (s), and afraction field (/)• These are stored as shown 
in Figure 4-5. The exponent field is a twos-complement number. The sign field 
and fraction field may be considered as one unit and referred to as the mantissa 
field (man). The twos-complement fraction is combined with the sign bit and 
the implied most significant bit to create the mantissa. The mantissa is used 
to represent a normalized twos-complement number. In a normalized repre- 
sentation, a most significant nonsign bit is implied, thus providing an additional 
bit of precision. The value of afloating-point number xas a function of the fields 
e, s, and f\s given as 

x = 01 ix 2 e if s = 0, or where the leading zero is the sign bit and the 
one is the implied most significant nonsign bit. 
1 0.fx 2 e if s = 1 , or where the leading one is the sign bit and the 

zero is the implied most significant nonsign bit. 
if e = most negative twos complement 

value of the specified exponent field width. 



Figure 4-5. Generic Floating-Point Format 



e 


s 


f 



man (mantissa) 



Note: e = exponent field 

s = single-bit sign field 
f = fraction field 



Three floating-point formats are supported on the TMS320C3x. The first is a 
short floating-point format for immediate floating-point operands, consisting of 
a 4-bit exponent, 1 sign bit, and an 1 1 -bit fraction. The second is a single-preci- 
sion format consisting of an 8-bit exponent, 1 sign bit, and a 23-bit fraction. The 
third is an extended-precision format consisting of an 8-bit exponent, 1 sign bit, 
and a 31 -bit fraction. 



4.3.1 Short Floating-Point Format 



In the short floating-point format, floating-point numbers are represented by a 
twos-complement 4-bit exponent field (e) and a twos-complement 12-bit man- 
tissa field (man) with an implied most significant nonsign bit. 
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Figure 4-6. Short Floating-Point Format 



Floating-Point Formats 



15 12 11 10 



man 



*l 



Operations are performed with an implied binary point between bits 11 and 1 0. 
When the implied most significant nonsign bit is made explicit, it is located to 
the immediate left of the binary point. The floating-point twos-complement 
number x in the short floating-point format is given by 

x=0 J Ux2e if s = 

10./x2® ifs=1 

ife = -8 

You must use the following reserved values to represent zero in the short float- 
ing-point format: 

e = -8 

s = 

The following examples illustrate the range and precision of the short float- 
ing-point format: 



Most Positive: 
Least Positive: 
Least Negative: 
Most Negative: 



x= (2-2 ~ 11 ) x 2? = 2.5594 x 10 2 
x=1 x2 "7 = 7.8125x10 ~ 3 
x=(-1-2- 11 )x2- 7 = -7.8163x10" 3 
x = -2x2 7 = -2.5600x10 2 
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4.3.2 Single-Precision Floating-Point Format 



In the single-precision format, the floating-point number is represented by an 
8-bitexponent field (e) and atwos-complement24-bitmantissafield {man) with 
an implied most significant nonsign bit. 

Operations are performed with an implied binary point between bits 23 and 22. 
When the implied most significant nonsign bit is made explicit, it is located to 
the immediate left of the binary point. The floating-point number x is given by 

x=0 J Ux2 e ifs = 
10./X2® ifs=1 
if e = -1 28 



Figure 4-7. Single-Precision Floating-Point Format 



31 24 



23 22 



You must use the following reserved values to represent zero in the single-pre- 
cision floating-point format: 

e = -128 

s = 

f=0 

The following examples illustrate the range and precision of the single-preci- 
sion floating-point format. 

Most Positive: x= (2-2 -23) x 2 127 = 3.4028234 x10 38 

Least Positive: x = 1 x2 " 1 27 = 5.877471 7 x 1 - 39 

Least Negative: x= (-1-2 -23) X 2 -127 = _ 5.8774724x10-39 

Most Negative: x=-2x2 12 7 = - 3.4028236 x10 38 



4.3.3 Extended-Precision Floating-Point Format 



In the extended-precision format, the floating-point number is represented by 
an 8-bit exponent field (e) and a 32-bit mantissa field (man) with an implied 
most significant nonsign bit. 
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Floating-Point Formats 



Operations are performed with an implied binary point between bits 31 and 30. 
When the implied most significant nonsign bit is made explicit, it is located to 
the immediate left of the binary point. The floating-point number x is given by: 



x=01.fx2e 


if s = 


10.7x26 


ifs=1 





if e = -1 28 



Figure 4-8. Extended-Precision Floating-Point Format 



39 



32 31 30 



man 



You must use the following reserved values to represent zero in the exten- 
ded-precision floating-point format: 

e = -128 

s = 

f=0 

The following examples illustrate the range and precision of the extended-pre- 
cision floating-point format: 



Most Positive: 
Least Positive: 
Least Negative: 
Most Negative: 



x= (2-2-31)x 2 127 = 3.4028236683 x10 38 
x=1 x 2 - 127 = 5.8774717541 x10~ 39 
x= (-1-2 -31) X 2 -127 = _ 5.8774717569x10 "39 
x=- 2 x 2127 = _ 3.4028236691 x 10 38 
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4.3.4 Conversion Between Floating-Point Formats 



Floating-point operations assume several different formats for inputs and out- 
puts. These formats often require conversion from one floating-point format to 
another (e.g., short floating-point format to extended-precision floating-point 
format). Format conversions occur automatically in hardware, with no over- 
head, as a part of the floating-point operations. Examples of the four conver- 
sions are shown below. When a floating-point format zero is converted to a 
greater-precision format, it is always converted to a valid representation of zero 
in that format. In the below figures, s = sign bit of the exponent. 

Qi Short floating-point format conversion to single-precision floating- 
point format. 



15 



12 11 10 



s x x x y y y 



31 



(a) Short Floating-Point Format 
27 24 23 22 



12 11 



ssssxxxxyy y 



(b) Single-Precision Floating-Point Format 



In this format, the exponent field is sign-extended and the fraction field filled 
with zeros. 

□i Short floating-point format conversion to extended-precision float- 
ing-point format. 



15 



12 11 10 



s x x x y y y 



(a) Short Floating-Point Format 
39 35 32 31 30 20 19 



ssssxxxxyy y 



(b) Extended-Precision Floating-Point Format 



The exponent field in this format is sign-extended and the fraction field filled 
with zeros. 
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Floating-Point Formats 



Qi Single-precision floating-point format conversion to extended-pre- 
cision floating-point format. 



31 



24 23 22 



x x y y y 



39 



(a)Single-Precision Floating-Point Format 
32 31 30 



8 7 



x x y y y 



(b) Extended-Precision Floating-Point Format 

The fraction field is filled with zeros. 

Qi Extended-precision floating-point format conversion to single-pre- 
cision floating-point format. 



39 



32 31 30 



8 7 



x x y y y z z 



31 



(a) Extended-Precision Floating-Point Format 
24 23 22 



X X 


y 


y y 



(b) Single-Precision Floating-Point Format 



The fraction field is truncated. 



4-9 



Floating-Point Multiplication 

4.4 Floating-Point Multiplication 

A floating-point number a can be written in floating-point format as in the follow- 
ing formula, where <x(man) is the mantissa and a(exp) is the exponent. 

a = a{man) x 2 a ( ex P) 

The product of a and b is c, defined as 

c = a x b = <x(man) x b(man) x 2(«( e *P) +b ( ex P)) 

c(man) = a{man) x b{man) 
c(exp) = a{exp) + b(exp) 

During floating-point multiplication, source operands are always assumed to 
be in the single-precision floating-point format. If the source of the operands 
is in short floating-point format, it is extended to the single-precision float- 
ing-point format. If the source of the operands is in extended-precision float- 
ing-point format, it is truncated to single-precision format. These conversions 
occur automatically in hardware with no overhead. All results of floating-point 
multiplications are in the extended-precision format. These multiplications oc- 
cur in a single cycle. 

A flowchart for floating-point multiplication is shown in Figure 4-9. In step 1, 
the 24-bit source operand mantissas are multiplied, producing a 50-bit result 
c(mari). (Note that input and output data are always represented as normalized 
numbers.) In step 2, the exponents are added, yielding c(exp). Steps 3 through 
6 checkf or special cases. Step 3 checks for whether c(man) in extended-preci- 
sion format is equal to zero. If c{man) is zero, step 7 sets c{exp) to -128, thus 
yielding the representation for zero. 

Steps 4 and 5 normalize the result. If a right shift of one is necessary, then in 
step 8,c(man) is right-shifted one bit, and one is added to c(exp). If a right shift 
of two is necessary, then in step 9, c{man) is right-shifted two bits, and two is 
added to c(exp). Step 6 occurs when the result is normalized. 

In step 1 0, c(man) is set in the extended-precision floating-point format. Steps 
1 1 through 1 6 check for special cases of c(exp). In step 1 4, if c{exp) has over- 
flowed (step 1 1 ) in the positive direction, then c{exp) is set to the most-positive 
extended-precision format value. If c(exp) has overflowed in the negative di- 
rection, then c{exp) is set to the most-negative extended-precision format val- 
ue. If c(exp) has underf lowed (step 12), then c is set to zero (step 15); i.e., 
c(man) = and c{exp) = -1 28. 



4- 1 Data Formats and Floating-Point Operation 



Figure 4-9. Flowchart for Floating-Point Multiplication 



a(man) b(man) 

_i L 



ill 



Multiply mantissas 



c(man) = a(man) x b{man) 
(50-bit result) 



<x(exp) 

1 



Floating-Point Multiplication 



b(exp) 
i (2) 



Add exponents 



c(exp) = cx(exp) + b(exp) 



Test for special cases of c(man) 



(3) 
c{man) = 



(4) 
Right- shift 1 
to normalize 



(5) 
Right- shift2 
to normalize 



(6) 

No shift 

to normalize 



JL 



(7) 



c(exp) = 
-128 



(8) 



c(man) > > 1 

and c(exp) = 

c(exp) + 1 



(9) 



c(man) > > 2 

and c(exp) = 

c(exp) + 2 



1 



£ 



Dispose of extra bits 



Put c(man) in extended 
precision floating-point 
format 



(10) 



Test for special cases of c(exp) 



(11) (12) (13) 

c(exp) overflow c(exp) underflow c(exp) in range 



M 



If c(man) >0, 
set c to most 
positive value. 
If c(man) < 0, 
set c to most 
negative value. 



c(exp)=-128 
c(man) = 



(15) 



Set c to final result 



T 



(16) 



c = axb 
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The following examples illustrate howfloating-point multiplication is performed 
on the TMS320C3x. For these examples, the implied most significant nonsign 
bit is made explicit. 



Example 4-1. Floating-Point Multiply (Both Mantissas = -2.0) 

Let 

a = -2.0x2«(^P) = 10.00000000000000000000000 x 2«( e *P) 
b = -2.0x2b(e*p) = 10.00000000000000000000000 x2 b (e*p) 

where a and b are both represented in binary form according to the normalized 
single-precision floating-point format. Then 

10.00000000000000000000000 x 2«( e *P) 
x 10.00000000000000000000000 x 2b(exp) 

0100.0000000000000000000000000000000000000000000000 x 2(«( e *P) +b(exp)) 

To place this number in the proper normalized format, it is necessary to shift 
the mantissa two places to the right and add two to the exponent. This yields 

10.00000000000000000000000 x 2< ex P) 
x 10.00000000000000000000000 x 2 b ( e *P) 

01 .0000000000000000000000000000000000000000000000 x 2 ( a(exp) +b(exp) +2) 

In floating-point multiplication, the exponent of the result may overflow. This 
can occur when the exponents are initially added or when the exponent is mo- 
dified during normalization. 

Example 4-2. Floating-Point Multiply (Both Mantissas = 1.5) 

Let 

a = 1.5 x 2«(e*p) = 01.10000000000000000000000 x 2«(e*p) 
b = 1 .5 x 2 b ( e *P) = 01 .1 0000000000000000000000 x 2 b ( e *P) 

where a and b are both represented in binary form according to the single-pre- 
cision floating-point format. Then 

01.10000000000000000000000 x 2«(e*P) 
x 01.10000000000000000000000 x 2 b (exp) 

0010.0100000000000000000000000000000000000000000000 x 2 («( e *P) +b(exp)) 
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To place this number in the proper normalized format, it is necessary to shift 
the mantissa one place to the right and add one to the exponent. This yields 

01.10000000000000000000000 x 2«(**P) 
x 01.10000000000000000000000 x 2 b ( e *P) 

01.00100000000000000000000000000000000000000000000 x 2 («(exp)+b(exp)+1) 

Example 4-3. Floating-Point Multiply (Both Mantissas = 1.0) 

Let 

a = 1 .0 x 2<*(e*P) = 01 .00000000000000000000000 x 2«(e*P) 
b = 1 .0 x 2b(e*p) = 01 .00000000000000000000000 x 2 b (e*P) 

where a and b are both represented in binary form according to the single-pre- 
cision floating-point format. Then 

01.00000000000000000000000 x 2«(e*p) 
x 01.00000000000000000000000 x 2b(e*P) 

0001.0000000000000000000000000000000000000000000000 x 2(«(exp)+b(exp)) 

This number is in the proper normalized format. Therefore, no shift of the man- 
tissa or modification of the exponent is necessary. 

These examples have shown cases where the product of two normalized num- 
bers can be normalized with a shift of zero, one, or two. For all normalized in- 
puts with the floating-point format used by the TMS320C3x, a normalized re- 
sult can be produced by a shift of zero, one, or two. 

Example 4-4. Floating-Point Multiply Between Positive and Negative Numbers 

Let 

a = 1 .0 x 2«(e*p) = 01 .00000000000000000000000 x 2«(e*P) 
b = -2.0 x 2 b ( e *P) = 1 0.00000000000000000000000 x 2 b ( e *P) 

Then 

01.00000000000000000000000 x 2«(e*P) 
x 10.00000000000000000000000 x 2 b (e*p) 

1 1 1 0.0000000000000000000000000000000000000000000000 x 2 Mexp) +b(e*P)) 
The result is c = - 2.0 x 2(«(©*P) + b(exp)) 

Example 4-5. Floating-Point Multiply by Zero 

All multiplications by a floating-point zero yield a result of zero {f= 0, s = 0, and 
exp= -128). 
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4.5 Floating-Point Addition and Subtraction 



In floating-point addition and subtraction, two floating-point numbers a and b 
can be defined as 

a = a(man) x 2 < ex P) 
b = b{man) x 2 b(exp) 

The sum (or difference) of a and b can be defined as 

c = cc± b 

= (a{man) ± (b(man) x 2 -Mexp)-b(exp)))) x 2 «(e*P), 

if <x{exp) > b(exp) 
= ((a{man) x 2 -{b(exp)-a(exp))) + b(man)) x 2 b (e*P), 

if cc(exp) < b(exp) 

The flowchart for floating-point addition is shown in Figure 4-10. Since this 
flowchart assumes signed data, it is also appropriate for floating-point subtrac- 
tion. In this figure, it is assumed that a{exp)<b(exp). In step 1, the source expo- 
nents are compared, and c(exp) is set equal to the largest of the two source 
exponents. In step 2, d is set to the difference of the two exponents. In step 3, 
the mantissa with the smallest exponent, in this case a(man), is right-shifted 
d bits in order to align the mantissas. After the mantissas have been aligned, 
they are added (step 4). 

Steps 5 through 7 check for a special case of c{man). If c{man) is zero (step 
5), then c(exp) is set to its most negative value (step 8) to yield the correct re- 
presentation of zero. If c(man) has overflowed c (step 6), then in step 9, c(man) 
is right-shifted one bit, and one is added to c{exp). In step 1 0, the result is nor- 
malized. In steps 11 and 12, special cases of c(exp) are tested. If c{exp) has 
overflowed, then c is set to the most positive extended-precision value if it is 
positive; otherwise, it is set to the most negative extended-precision value. 
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Figure 4-10. Flowchart for Floating-Point Addition 

a(man) 



b(man) 



_SL 



(3) 



Align mantissas 



a(man) = a(man) > > d 

Discard LSBs to keep 
a(man) in 
extended-precision 
floating-point format 



(2) 



(4) 



a(exp) 



b(exp) 



Compare exponents 



I 



Subtract exponents 



d = b(exp) - a{exp) 



Add mantissas 



c [man) = a(mari) + b(man) 



(1) 



If a(exp) < = b(exp) 
c(exp) = b(exp) 

else 

c(exp) = oc(exp) 

[Assume for simplicity 

that a(exp) << = b(exp)] 



Test for special cases of c(man) 



(5) 
c(man) = 



(6) 
Overflow of c(man) 



(7) 
k = # leading 
non-significant 
sign bits 



(8) 



A. 



c(man) = c(man) > > 1 
c(exp) = c(exp) + 1 
Discard LSBs to keep in 
extended-precision 
floating-point format 



c(exp) =-128 



I 



(9) 



c(man) < < k 
c(exp) = c{exp)- 



S. 



I 



Test for special cases of c(exp) 



(11) 
c(exp) overflow 



(12) 
c(exp) underflow 



(13) 
c(exp) in range 



(14) 



If c(man) > 0, 
set c to most 
positive value. 
If c(man) < 0, 
set c to most 
negative value. 



i 



set c to zero 
c(exp) = -1 28 
c(man) = 



(15) 



w (16) 

Set c to final result | 



c = a + b 
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The following examples describe the floating-point addition and subtraction 
operations. It is assumed that the data is in the extended-precision 
floating-point format. 



Example 4-6. Floating-Point Addition 



In the case of two normalized numbers to be summed, let 

a = 1 .5 = 01 .1 000000000000000000000000000000 x 2° 
b = 0.5 = 01 .0000000000000000000000000000000 x 2" 1 

It is necessary to shift b to the right by one so that cc and b have the same expo- 
nent. This yields 

b = 0.5 = 00.1000000000000000000000000000000 x 2° 
Then 

01.10000000000000000000000000000000 x 20 



+ 



00.10000000000000000000000000000000 x 20 



010.00000000000000000000000000000000 x 20 

As in the case of multiplication, it is necessary to shift the binary point one place 
to the left and to add one to the exponent. This yields 

01.1000000000000000000000000000000 x 2° 
+ 00.1000000000000000000000000000000 x 2 ° 

01.0000000000000000000000000000000 x 2 1 

Example 4-7. Floating-Point Subtraction 

A subtraction is performed in this example. Let 

a = 01 .0000000000000000000000000000001 . x 20 
b = 01. 0000000000000000000000000000000 x 20 

The operation to be performed is a - b. The mantissas are already aligned be- 
cause the two numbers have the same exponent. The result is a large cancella- 
tion of the upper bits, as shown below. 

01.0000000000000000000000000000001 x 2° 
- 01.0000000000000000000000000000000 x 2 

00.0000000000000000000000000000001 x 2° 
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Floating-Point Addition and Subtraction 



The result must be normalized. In this case, a left-shift of 31 is required. The 
exponent of the result is modified accordingly. The result is 

01.0000000000000000000000000000001 x 2° 
- 01.0000000000000000000000000000000 x 2 ° 

01.0000000000000000000000000000000 x 2~ 31 



Example 4-8. Floating-Point Addition With a 32-Bit Shift 

This example illustrates a situation where a full 32-bit shift is necessary to nor- 
malize the result. Let 

a = 01.1111111111111111111111111111111 x 2127 
b = 10.0000000000000000000000000000000 x 2 127 

The operation to be performed is a + b. 

01.1111111111111111111111111111111 x 2 127 

+ 10.0000000000000000000000000000000 x 2 127 

11.1111111111111111111111111111111 x 2127 

Normalizing the result requires a left-shift of 32 and a subtraction of 32 from 
the exponent. The result is 

01.1111111111111111111111111111111 x 2127 
+ 10.0000000000000000000000000000000 x 2 127 
10.0000000000000000000000000000000 x 2 95 

Example 4-9. Floating-Point Addition/Subtraction and Zero 

When floating-point addition and subtraction are performed with a float- 
ing-point 0, the following identities are satisfied: 

a ± = a (a * 0) 

+ = 

-a = - a (a * 0) 
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4.6 Normalization Using the NORM Instruction 



The NORM instruction normalizes an extended-precision floating-point num- 
ber that is assumed to be unnormalized. Since the number is assumed to be 
unnormalized, no implied most significant nonsign bit is assumed. The NORM 
instruction executes the following three steps: 

1 ) Locates the most significant nonsign bit of the floating-point number. 

2) Left-shifts to normalize the number. 

3) Adjusts the exponent. 

Given the extended-precision floating-point value a to be normalized, the nor- 
malization, norm ( ), is performed as shown in Figure 4-11. 



Figure 4-11. Flowchart for NORM Instruction Operation 
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Set c to final result 
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Normalization Using the NORM Instruction 

Example 4-10. NORM Instruction 

Assume that an extended-precision register contains the value 
man = 00000000000000000001000000000001, exp = 

When the normalization is performed on a number assumed to be unnormal- 
ized, the binary point is assumed to be 

man = 0.0000000000000000001 000000000001 , exp = 

This number is then sign-extended one bit so that the mantissa contains 33 
bits. 

man = 00.0000000000000000001000000000001, exp = 

The intermediate result after the most significant nonsign bit is located and the 
shift performed is: 

man = 01 .000000000001 0000000000000000000, exp = -19 

The final 32-bit value output after removing the redundant bit is: 
man = 0000000000001 0000000000000000000, exp = -19 

The NORM instruction is useful for counting the number of leading zeros or 
leading ones in a 32-bit field. If the exponent is initially zero, the absolute value 
of the final value of the exponent is the number of leading ones or zeros. This 
instruction is also useful for manipulating unnormalized floating-point num- 
bers. 
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4.7 Rounding: The RND Instruction 



The RND instruction rounds a number from the extended-precision float- 
ing-point format to the single-precision floating-point format. Rounding is simi- 
lar to floating-point addition. Given the number a to be rounded, the following 
operation is performed first. 

c = a(man) x 2«(**P) + (1 x 2«( e *P)" 24 ) 

Next, a conversion from extended-precision floating-point to single-precision 
floating-point format is performed. Given the extended-precision floating-point 
value, the rounding, md( ), is performed as shown in Figure 4-12. 
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Rounding: The RND Instruction 



Figure 4-12. Flowchart for Floating-Point Rounding by the RND Instruction 

a 1 X2 
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c (exp) overflow 



c (exp) in range 



JE. 



Ifc (man) > 0, 
set c to most positive 
single-precision value. 
Ifc (man) < 0, 
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4.8 Floating-Point to Integer Conversion 



Floating-point to integer conversion, using the FIX instructions, allows exten- 
ded-precision floating-point numbers to be converted to single-precision inte- 
gers in a single cycle. The floating-point to integer conversion of the value x 
is referred to here as fix(x). The conversion does not overflow if a, the number 
to be converted, is in the range 

- 23" 1 < a < 23" 1 - 1 
First, you must be certain that 
a(exp) < 30 

If these bounds are not met, an overflow occurs. If an overflow occurs in the 
positive direction, the output is the most positive integer. If an overflow occurs 
in the negative direction, the output is the most negative integer. If oc(exp) is 
within the valid range, then a{man), with implied bit included, is sign-extended 
and right-shifted (rs) by the amount 

rs = 31 - a(exp) 

This right-shift (rs) shifts out those bits corresponding to the fractional part of 
the mantissa. For example: 

If < x < 1 , then fix(x) = 0. 
If -1 < x < 0, then fix(x) = -1 . 

The flowchart for the floating-point to integer conversion is shown in 
Figure 4-13. 
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Floating-Point to Integer Conversion 



Figure 4-13. Flowchart for Floating-Point to Integer Conversion by FIX Instructions 
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4.9 Integer to Floating-Point Conversion Using the FLOAT Instruction 

Integer to floating-point conversion, using the FLOAT instruction, allows sing- 
le-precision integers to be converted to extended-precision floating-point num- 
bers. The flowchart for this conversion is shown in Figure 4-14. 

Figure 4-14. Flowchart for Integer to Floating-Point Conversion by FLOAT Instructions 
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Chapter 5 

Addressing 



The TMS320C3x supports five groups of powerful addressing modes. Six 
types of addressing may be used within the groups, which allow access of data 
from memory, registers, and the instruction word. This chapter details the oper- 
ation, encoding, and implementation of the addressing modes. It also dis- 
cusses the management of system stacks, queues, and deques in memory. 
These are the major topics in this chapter: 

□i Types of Addressing (Section 5.1 on page 5-2) 

Q Register 

B Direct 

B Indirect 

■ Short-immediate 

■ Long-immediate 

■ PC-relative 

Qi Groups of Addressing Modes (Section 5.2 on page 5-19) 

B General addressing modes 

n Three-operand addressing modes 

b Parallel addressing modes 

B Long-immediate addressing mode 

B Conditional-branch addressing modes 
Qi Circular Addressing (Section 5.3 on page 5-24) 
Qi Bit-Reversed Addressing (Section 5.4 on page 5-29) 
Qi System Stack Management (Section 5.5 on page 5-30) 
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5.1 Types of Addressing 

Six types of addressing allow access of data from memory, registers, and the 
instruction word: 

■ Register 

■ Direct 

■ Indirect 

■ Short-immediate 

■ Long-immediate 

■ PC-relative 

Some types of addressing are appropriate for some instructions and not oth- 
ers. For this reason, the types of addressing are used in the five different 
groups of addressing modes as follows: 

Q General addressing modes (G): 

■ Register 

■ Direct 

■ Indirect 

n Short-immediate 

Q Three-operand addressing modes (T): 

■ Register 

■ Indirect 

Q Parallel addressing modes (P): 
H Register 

■ Indirect 

ti Long-immediate addressing mode 

■ Long-immediate 

□ Conditional-branch addressing modes (B): 

■ Register 

n PC-relative 

The six types of addressing are discussed first, followed by the five groups of 
addressing modes. 
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5.1.1 Register Addressing 



In register addressing, a CPU register contains the operand, as shown in this 
example: 



ABSF 



Rl 



Rl = | Rl | 



The syntax for the CPU registers, the assembler syntax, and the assigned 
function for those registers are listed in Table 5-1 . 



Table 5-1. CPU Register/Assembler Syntax and Function 



CPU Register 
Address 


Assembler 
Syntax 


Assigned 
Function 


OOh 
01 h 
02h 
03h 
04h 
05h 
06h 
07h 


RO 
R1 
R2 
R3 
R4 
R5 
R6 
R7 


Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 


08h 
09h 
OAh 
OBh 
OCh 
ODh 
OEh 
OFH 


ARO 
AR1 
AR2 
AR3 
AR4 
AR5 
AR6 
AR7 


Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 


10h 
11h 
12h 
13h 

14h 


DP 
IRO 
IR1 
BK 
SP 


Data-page pointer 
Index register 
Index register 1 
Block-size register 
Active stack pointer 


15h 
16h 
17h 
18h 


ST 
IE 
IF 
IOF 


Status register 
CPU/DMA interrupt enable 
CPU interrupt flags 
I/O flags 


19h 
1Ah 
1Bh 


RS 
RE 
RC 


Repeat start address 
Repeat end address 
Repeat counter 
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5.1.2 Direct Addressing 



In direct addressing, the data address is formed by the concatenation of the 
eight least significant bits of the data page pointer (DP) with the 1 6 least signifi- 
cant bits of the instruction word (expr). This results in 256 pages (64K words 
per page), giving the programmer a large address space without requiring a 
change of the page pointer. The syntax and operation for direct addressing are 
listed below. 

Syntax: @expr 

Operation: address = DP concatenated with expr 

Figure 5-1 shows the formation of the data address. Example 5-1 gives an in- 
struction example with data before and after instruction execution. 



Figure 5-1. Direct Addressing 
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Example 5-1. Direct Addressing 

ADDI @0BCDEh,R7 

Before Instruction: 

DP = 8Ah 

R7 = 0h 

Data at 8ABCDEh = 12345678h 



After Instruction: 

DP = 8Ah 

R7 = 12345678h 

Data at 8ABCDEh = 12345678h 
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5.1.3 Indirect Addressing 



Indirect addressing is used to specify the address of an operand in memory 
through the contents of an auxiliary register, optional displacements, and index 
registers. Only the 24 least significant bits of the auxiliary registers and index 
registers are used in indirect addressing. This arithmetic is performed by the 
auxiliary register arithmetic units (ARAUs) on these lower 24 bits and is un- 
signed. The upper eight bits are unmodified. 

The flexibility of indirect addressing is possible because the ARAUs on the 
TMS320C3x are used to modify auxiliary registers in parallel with operations 
within the main CPU. Indirect addressing is specified by a five-bit field in the 
instruction word, referred to as the mod field. A displacement is either an ex- 
plicit unsigned 8-bit integer contained in the instruction word or an implicit dis- 
placement of one. Two index registers, IRO and IR1 , can also be used in indi- 
rect addressing. In some cases, an addressing scheme using circular or bit- 
reversed addressing is optional. The mechanism for generating addresses in 
circular addressing is discussed in Section 5.3, bit-reversed in Section 5.4. 

Table 5-2 lists the various kinds of indirect addressing, along with the value of 
the modification (mod) field, assembler syntax, operation, and function for 
each. The succeeding 1 8 examples show the operation for each kind of indirect 
addressing. 
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Table 5-2. Indirect Addressing 



Mod Field 


Syntax 


Operation 


Description 


Indirect Addressing with Displacement 


00000 


*+ARn(disp) 


addr = ARn + disp 


With predisplacement add ' 


00001 


*- ARn(disp) 


addr = ARn - disp 


With predisplacement subtract 


00010 


*++ARn(disp) 


addr = ARn + disp 
ARn = ARn + disp 


With predisplacement add and modify 


00011 


*- - ARn(disp) 


addr = ARn - disp 
ARn = ARn - disp 


With predisplacement subtract and modify 


00100 


*ARn++(disp) 


addr = ARn 
ARn = ARn + disp 


With postdisplacement add and modify 


00101 


*ARn — (disp) 


addr = ARn 
ARn = ARn - disp 


With postdisplacement subtract and modify 


00110 


*ARn++(disp)% 


addr = ARn 

ARn = circ(ARn + disp) 


With postdisplacement add and circular 
modify 


00111 


*ARn - - (disp)% 


add = ARn 

ARn = circ(ARn - disp) 


With postdisplacement subtract and 
circular modify 


Indirect Addressing with Index Register IRO 


01000 


*+ARn(IR0) 


addr = ARn + IRO 


With preindex (IRO) add 


01001 


*-ARn(IR0) 


addr = ARn -IRO 


With preindex (IRO) subtract 


01010 


*++ARn(IR0) 


addr = ARn + IRO 
ARn = ARn + IRO 


With preindex (IRO) add and modify 


01011 


* — ARn(IRO) 


addr = ARn -IRO 
ARn = ARn -IRO 


With preindex (IRO) subtract and modify 


01100 


*ARn++(IR0) 


addr = ARn 
ARn = ARn + IRO 


With postindex (IRO) add and modify 


01101 


*ARn — (IRO) 


addr= ARn 
ARn = ARn -IRO 


With postindex (IRO) subtract and modify 


01110 


*ARn++(IR0)% 


addr = ARn 

ARn = circ(ARn + IRO) 


With postindex (IRO) add and circular 
modify 


01111 


*ARn — (IR0)% 


addr = ARn 

ARn = circ(ARn) - IRO 


With postindex (IRO) subtract and circular 
modify 



LEGEND: 




addr 


= memory address 


ARn 


= auxiliary register ARO - AR7 


IRn 


= index register IRO or IR1 


disp 


= displacement 


++ 


= add and modify 


-- 


= subtract and modify 


circ( ) 


= address in circular addressing 


% 


= where circular addressing is performed 
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Table 5-2. Indirect Addressing (Concluded) 



Mod Field Syntax Operation 


Description 


indirect Addressing with Index Register IR1 


10000 


*+ARn(IR1) 


addr = ARn + IR1 


With preindex(IRI) add 


10001 


*-ARn(IR1) 


addr = ARn - IR1 


With preindex (IR1) subtract 


10010 


*++ARn(IR1) 


addr = ARn + IR1 
ARn = ARn + IR1 


With preindex(IRI) add 
and modify 


10011 


* — ARn(IR1) 


addr = ARn-IR1 
ARn = ARn-IR1 


With preindex (IR1) subtract 
and modify 


10100 


*ARn++(IR1) 


addr = ARn 
ARn=ARn + IR1 


With postindex (IR1) add 
and modify 


10101 


*ARn — (IR1) 


addr = ARn 
ARn = ARn-IR1 


With postindex (IR1) subtract 
and modify 


10110 


*ARn++(IR1)% 


addr = ARn 

ARn = circ(ARn + IR1) 


With postindex (IR1) add 
and circular modify 


10111 


*ARn — (IR1)% 


addr = ARn 

ARn = circ(ARn-IR1) 


With postindex (IR1) subtract 
and circular modify 


Indirect Addressing (Special Cases) 


11000 


*ARn 


addr = ARn 


Indirect 


11001 


*ARn ++ (IRO)B 


addr = ARn 

ARn = B(ARn + IRO) 


With postindex (IRO) add 
and bit-reversed modify 



LEGEND: 






addr 


= 


memory address 


ARn 


= 


auxiliary register AR0 - AR7 


IRn 


= 


index register IRO or IR1 


disp 


= 


displacement 


++ 


= 


add and modify 


-- 


= 


subtract and modify 


circ( ) 


= 


address in circular addressing 


% 


= 


where circular addressing is performed 


B 


= 


where bit-reversed addressing is performed 
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Example 5-2. Auxiliary Register Indirect 



An auxiliary register (ARn) contains the address of the operand to be fetched. 
Operation: operand address = ARn 

Assembler Syntax: *ARn 

Modification Field: 11000 



ARn 



31 


24 


23 









X 


X 


address 


31 






ir 





operand 



Example 5-3. Indirect With Predisplacement Add 



The address of the operand to be fetched is the sum of an auxiliary register 
(ARn) and the displacement (disp). The displacement is either an eight-bit un- 
signed integer contained in the instruction word or an implied value of 1 . 

Operation: operand address = ARn+ disp 

Assembler Syntax: *+ARn(disp) 

Modification Field: 00000 



31 



ARn 



24 23 



address 



31 



disp 



0...0 



8 7 



integer 



> (+) 



31 



operand 
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Example 5-4. Indirect With Predisplacement Subtract 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn) minus the displacement (disp). The displacement is either an eight-bit 
unsigned integer contained in the instruction word or an implied value of 1 . 

Operation: operand address = ARn- disp 

Assembler Syntax: *- AR n(disp) 

Modification Field: 00001 



31 



ARn 



31 



24 23 



X X 


address 





31 




8 


7 




disp 





0...0 





integer 


tk 





operand 



(-) 



Lj> 



Example 5-5. Indirect With Predisplacement Add and Modify 



The address of the operand to be fetched is the sum of an auxiliary register 
(ARn) and the displacement (disp). The displacement is either an eight-bit un- 
signed integer contained in the instruction word or an implied value of 1 . After 
the data is fetched, the auxiliary register is updated with the address gener- 
ated. 



Operation: 

Assembler Syntax: 
Modification Field: 



operand address = AR/7+ disp 
ARn = ARn + disp 
*++ARn(disp) 
00010 



31 



ARn 



24 23 



address 



31 



disp 



0...0 



31 



8 7 



integer 



(+) 



operand 
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Example 5-6. Indirect With Predisplacement Subtract and Modify 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn) minus the displacement (disp). The displacement is either an eight-bit 
unsigned integer contained in the instruction word or an implied value of 1 . Af- 
ter the data is fetched, the auxiliary register is updated with the address gener- 
ated. 



Operation: 

Assembler Syntax: 
Modification Field: 



operand address = ARa?- disp 
ARn= AR/7+ disp 

*--ARn(disp) 
00011 



31 



ARn 



31 



disp 



0...0 



31 



24 23 



address 



8 7 



integer 



~E 



(-) 



operand 



Example 5-7. Indirect With Postdisplacement Add and Modify 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the displacement (disp) is added to the 
auxiliary register. The displacement is either an eight-bit unsigned integer con- 
tained in the instruction word or an implied value of 1 . 

Operation: operand address = ARn 

ARn= ARm-disp 

Assembler Syntax: *ARr?++ (disp) 

Modification Field: 00100 



31 



ARn 



24 23 



address 



31 



disp 



0...0 



integer 



(+) 



31 



operand 
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Example 5-8. Indirect With Postdisplacement Subtract and Modify 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the displacement (disp) is subtracted from 
the auxiliary register. The displacement is either an eight-bit unsigned integer 
contained in the instruction word or an implied value of 1 . 
Operation: operand address = ARn 

ARn = AR/7- disp 

Assembler Syntax: *ARn — (disp) 

Modification Field: 00101 



31 



ARn 



31 



disp 



0...0 



24 23 



address 



8 7 



integer 



5rJ_> (- 



) <H» 



31 



SL 



operand 



Example 5-9. Indirect With Postdisplacement Add and Circular Modify 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the displacement (disp) is added to the 
contents of the auxiliary register using circular addressing. This result is used 
to update the auxiliary register. The displacement is either an eight-bit un- 
signed integer contained in the instruction word or an implied value of 1 . 
Operation: operand address = ARn 

ARn = circ(ARn+ disp) 

Assembler Syntax: *ARn++ (disp)% 

Modification Field: 00110 



31 



ARn 



31 



disp 



0...0 



31 



24 23 



8 7 



address 



teger I — > 



integer 



T" 

(%) 
i 

(+) *~? 



_2_ 



operand 
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Example 5-10. Indirect With Postdisplacement Subtract and Circular Modify 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the displacement (disp) is subtracted from 
the contents of the auxiliary register using circular addressing. This result is 
used to update the auxiliary register. The displacement is either an eight-bit un- 
signed integer contained in the instruction word or an implied value of 1 . 

Operation: operand address = ARn 

ARn= circ(AR/7- disp) 

Assembler Syntax: *ARn--(disp)% 

Modification Field: 00111 



31 



disp 



24 23 





ARn . h 


X X 


address 






31 






8 7 




t 
(%) 







0...0 





integer 










31 













operand 



Example 5-11. Indirect With Preindex Add 



The address of the operand to be fetched is the sum of an auxiliary register 
(ARn) and an index register (IRO or IR1). 

Operation: operand address = AR/7+ \Rm 

Assembler Syntax: *+ ARn(IRm) 



Modification Field: 



01000 ifm = 
10000 if m = 1 



31 



ARn 



IRm 



31 



24 23 



address 



31 


24 


23 









X 


X 


index 


w 


W 



> (+) 



operand 
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Example 5- 12. Indirect With Preindex Subtract 



The address of the operand to be fetched is the difference of an auxiliary regis- 
ter (ARn) and an index register (IRO or IR1). 

Operation: operand address = ARn- IR m 

Assembler Syntax: *-AR/?(IRm) 

Modification Field: 01001 ifm = 

10001 if m=1 



31 



ARn 



31 



IRm— $ 



24 23 



31 



24 23 



address 



index 
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Example 5-13. Indirect With Preindex Add and Modify 



The address of the operand to be fetched is the sum of an auxiliary register 
(ARn) and an index register (IRO or 1R1 ). After the data is fetched, the auxiliary 
register is updated with the address generated. 

Operation: operand address = AR/7+ IR m 

ARn= AR/7+ IRm 



Assembler Syntax: 
Modification Field: 



*++ARn(IRm) 

01010 ifm = 
10010 if m =1 
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Example 5-14. Indirect With Preindex Subtract and Modify 



The address of the operand to be fetched is the difference of an auxiliary regis- 
ter (ARn) and an index register (IRO or IR1). The resulting address becomes 
the new contents of the auxiliary register. 

Operation: operand address = ARn- IRm 

ARn= ARn- IRm 

Assembler Syntax: *--ARn(IRm) 

Modification Field: 01011 ifm = 

10011 if m =1 
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Example 5-15. Indirect With Postindex Add and Modify 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO or IR1 ) is added to 
the auxiliary register. 

Operation: operand address = ARn 

AR/7=ARn+ IRm 



Assembler Syntax: 
Modification Field: 



*ARn++(IRm) 

01100 ifm = 
10100 if m = 1 



ARn 
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Example 5-16. Indirect With Postindex Subtract and Modify 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO or IR1 ) is subtracted 
from the auxiliary register. 

Operation: operand address = ARn 

ARn= ARn -IRm 



Assembler Syntax: 
Modification Field: 



*ARn--(IRm) 

01101 ifm = 
10101 if m =1 



31 



ARn 



31 



IRm- 



fc x 



* x 
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Example 5- 1 7. Indirect With Postindex Add and Circular Modify 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO or IR1 ) is added to 
the auxiliary register. This value is evaluated using circular addressing and re- 
places the contents of the auxiliary register. 

Operation: operand address = ARn 

ARn = circ(ARn+ IRm) 

Assembler Syntax: *ARn++ (IRm)% 

Modification Field: 01110 if m = 

10110 if m=1 
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Example 5-18. Indirect With Postindex Subtract and Circular Modify 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO or IR1 ) is subtracted 
from the auxiliary register. This result is evaluated using circular addressing 
and replaces the contents of the auxiliary register. 

Operation: operand address = ARn 

ARn= circ(ARn-IRm) 

Assembler Syntax: *ARn-- (IRm)% 



Modification Field: 



01111 ifm= 

10111 if m =1 
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Example 5- 19. Indirect With Postindex Add and Bit-Reversed Modify 



The address of the operand to be fetched is the contents of an auxiliary register 
(ARn). After the operand is fetched, the index register (IRO) is added to the aux- 
iliary register. This addition is performed with a reverse-carry propagation and 
can be used to yield a bit-reversed (B) address. This value replaces the con- 
tents of the auxiliary register. 

Operation: operand address = ARn 

ARn=B(ARn+IR0) 

Assembler Syntax: *ARn++(IR0)B 

Modification Field: 11001 
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5.1.4 Short-Immediate Addressing 

In short-immediate addressing, the operand is a 16-bit immediate value con- 
tained in the 1 6 least significant bits of the instruction word (expr). Depending 
upon the data types assumed for the instruction, the short-immediate operand 
may be a twos-complement integer, an unsigned integer, or a floating-point 
number. This is the syntax for this mode: 

Syntax: expr 

Example 5-20 gives an instruction example with before- and after-instruction 
data. 

Example 5-20. Short-Immediate Addressing 

SUBI 1,R0 

Before Instruction: After Instruction: 

R0 = 0h R0 = OFFFFFFFFh 



5.1.5 Long-Immediate Addressing 

In long-immediate addressing, the operand is a 24-bit immediate value con- 
tained in the 24 least significant bits of the instruction word (expr). This is the 
syntax for this mode: 

Syntax: expr 

Example 5-21 gives an instruction example with before- and after-instruction 
data. 

Example 5-21. Long-Immediate Addressing 

BR 8000h 

Before Instruction: After Instruction: 

PC = Oh PC = 8000h 
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5.1.6 PC-Relative Addressing 

PC-relative addressing is used for branching. The assembler takes the src(a 
label or address) specified by the user and generates a displacement. If the 
branch is a standard branch, this displacement is equal to the label - (PC +1 ). 
If the branch is a delayed branch, this displacement is equal to the la- 
bel -(PC +3). 

The displacement is stored as a 1 6-bit signed integer in the least significant 
bits of the instruction word. This displacement is added to the PC if the condi- 
tion is true. 

Syntax: expr 

. Example 5-22 gives an instruction example with before- and after-instruction 
data. 

Example 5-22. PC-Relative Addressing 

BU NEWPC ; pc=1001h,NEWPC= 1005h,displacement= 3 

Before Instruction: After Instruction: 

PC = 1001h PC = 1005h 
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5.2 Groups of Addressing Modes 



Six types of addressing (covered in Section 5.1 , beginning on page 5-2) form 
these five groups of addressing modes: 

Qi General addressing modes (G) 

Q Three-operand addressing modes (T) 

□i Parallel addressing modes (P) 

Qi Long-immediate addressing mode 

Q Conditional-branch addressing modes (B) 



5.2.1 General Addressing Modes 



Instructions that use the general addressing modes are general-purpose in- 
structions, such as ADDI, MPYF, and LSH. Such instructions usually have this 
form: 

dst operation src -> dst 

where the destination operand is signified by dst and the source operand by 
src; operation defines an operation to be performed using the general ad- 
dressing modes to specify certain operands. Bits 31 — 29 are zero, indicating 
general addressing mode instructions. Bits 22 and 21 specify the general ad- 
dressing mode (G) field, which defines how bits 15 through are to be inter- 
preted for addressing the src operand. 

Options for bits 22 and 21 (G field) are as follows: 

register (all CPU registers unless specified otherwise) 

1 direct 

1 indirect 
1 1 immediate 

If the src and dsf fields contain register specif ications, the value in these fields 
contains the CPU register addresses as defined by Table 5-1 . For the general 
addressing modes, the following values of ARn are valid: 

ARn, < n < 7 

Figure 5-2 shows the encoding for the general addressing modes. The nota- 
tion modn indicates the modification field that goes with the ARn field. Refer 
to Table 5-2 for further information. 
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Figure 5-2. Encoding for General Addressing Modes 
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5.2.2 Three-Operand Addressing Modes 



Instructions that use the three-operand addressing modes, such as 
ADDI3, LSH3, CMPF3. or XOR3, usually have this form: 

SRC1 operation SRC2 -» dst 

where the destination operand is signified by dst and the source operands by 
SRC1 and SRC2; operation defines an operation to be performed. Note that 
the 3 can be omitted from three-operand instructions. 

Bits 31-29 are set to the value of 001 , indicating three-operand addressing 
mode instructions. Bits 22 and 21 specify the three-operand addressing mode 
(T) field, which defines how bits 1 5 — are to be interpreted for addressing the 
SRCoperands.Bits15 — 8areusedtodefinetheSRC1 address, and bits 7—0 
to define the SRC2 address. Options for bits 22 and 21 (T) are as follows: 



SRC1 



SRC2 






register 


register 


1 


indirect 


register 


1 


register 


indirect 


1 1 


indirect 


indirect 



Figure 5-3 shows the encoding for three-operand addressing. If the SRC1 
and SRC2 fields use the same auxiliary register, both addresses are correctly 
generated. However, only the value created by the SRC1 field is saved in the 
auxiliary register specified. The assembler issues a warning if this condition 
is specified by the user. 

The following values of ARn and ARm are valid: 

ARn,0<n< 7 
ARm,0<m<7 

The notation modmor modn indicates the modification field goes with the ARm 
or ARn field, respectively. Refer to Table 5-2 for further information. 
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In indirect addressing of the three-operand addressing mode, displacements 
(if used) are allowed to be or 1 , and the index registers (IRO and IR1 ) can be 
used. The displacement of 1 is implied and is not explicitly coded in the instruc- 
tion word. 



Figure 5-3. Encoding for Three-Operand Addressing Modes 
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5.2.3 Parallel Addressing Modes 



Instructions that use parallel addressing, indicated by || (two vertical bars), al- 
low for the greatest amount of parallelism possible. The destination operands 
are indicated as d1 and 62, signifying dstl and dst2, respectively (see Figure 
6-4). The source operands, signified by srd and src2, use the extended-pre- 
cision registers. The parallel operation to be performed is called operation. 



Figure 5-4. Encoding for Parallel Addressing Modes 
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The parallel addressing mode (P) field specifies how the operands are to be 
used, i.e., whether they are source or destination. The specific relationship be- 
tween the P field and the operands is detailed in the description of the individual 
parallel instructions (see Chapter 1 0). However, the operands are always en- 
coded in the same way. Bits 31 and 30 are set to the value of 1 0, indicating par- 
allel addressing mode instructions. Bits 25 and 24 specify the parallel address- 
ing mode (P) field, which defines how bits 21 — are to be interpreted for ad- 
dressing the src operands. Bits 21 — 19 are used to define the srd address, 
bits 18 — 1 6 to define the src2 address, bits 15 — 8 the src3 address, and bits 
7 — Othe src 4 address. The notations modn and modm indicate which modifi- 
cation field goes with which ARn or ARm (auxiliary register) field, respectively. 
The parallel addressing operands are listed below. 
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srd < srd < 7 (extended-precision registers RO — R7) 

src2 < src2 < 7 (extended-precision registers RO — R7) 

d1 If O.cfeff is RO. If 1, cfsff is R1. 

d2 If 0, dst2 is R2. If 1 , dst2 is R3. 

P 0< P<3 

src3 indirect (disp = 0, 1, IRO, IR1) 

srcA indirect (disp = 0, 1, IRO, IR1) 

As in the three-operand addressing mode, indirect addressing in the parallel 
addressing mode allows for displacements of or 1 and the use of the index 
registers (IRO and IR1). The displacement of 1 is implied and is not explicitly 
coded in the instruction word. 

In the encoding shown for this mode in Figure 5-4, if the src3 and src4 fields 
use the same auxiliary register, both addresses are correctly generated, but 
only the value created by the src3 field is saved in the auxiliary register speci- 
fied. The assembler issues a warning if this condition is specified by the user. 



5.2.4 Long-Immediate Addressing Mode 



The long-immediate addressing mode is used to encode the program control 
instructions (BR, BRD, and CALL); for this, it is useful to have a 24-bit absolute 
address contained in the instruction word. The unconditional branches, BR 
(standard) and BRD (delayed), use the long-immediate addressing mode. Bits 
31 — 26are set to the value of 011 000, indicating long-immediate addressing 
mode instructions. Selection of bit 24 determines the type of branch: D = for 
a standard branch or D = 1 for a delayed branch. The long-immediate operand 
is the 24-bit src. These instructions are encoded as shown in Figure 5-5. 
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Figure 5-5. Encoding for Long-Immediate Addressing Mode 
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5.2.5 Conditional-Branch Addressing Modes 



Instructions using the conditional-branch addressing modes (Bcond, BcondD, 
CALLcond, DBcond, and DBcondD) can perform a variety of conditional oper- 
ations. Bits 31 — 27 are setto the value of 011 01 , indicating conditional-branch 
addressing mode instructions. Bit 26 is setto or 1 ; the former selects DBcond, 
the latter Bcond. Selection of bit 25 determines the conditional-branch ad- 
dressing mode (B). If B = 0, register addressing is used; if B = 1 , PC-relative 
addressing is used. Selection of bit 21 sets the type of branch: D = for a stan- 
dard branch or D = 1 for a delayed branch. The condition field(cono) specifies 
the condition checked to determine what action to take, i.e., whether or not to 
branch (see Chapter 1 1 for a list of condition codes). Figure 6-6 shows the en- 
coding for conditional-branch addressing. 



Figure 5-6. Encoding for Conditional-Branch Addressing Modes 
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5.3 Circular Addressing 



Many algorithms, such as convolution and correlation, require the implementa- 
tion of a circular buffer in memory. In convolution and correlation, the circular 
buffer is used to implement a sliding window that contains the most recent data 
to be processed. As new data is brought in, the new data overwrites the oldest 
data. Key to the implementation of a circular buffer is the implementation of a 
circular addressing mode. This section describes the circular addressing mode 
of the TMS320C3X. 

The blocksize register (BK) specifies the size of the circular buffer. The bottom 
of the circular buffer is specified by the first bit (counting from the most signifi- 
cant bit to the least significant bit) in the lower 1 6 bits of the BK register, plus 
a user-selected auxiliary register (ARn). With the location of the first 1 bit speci- 
fied as bit N, the address at the top of the buffer is referred to as the effective 
base (EB) and is equal to bits 31 through (N+1) of ARn with bits N-1 through 
of EB being zero and bit N being one. 

Figure 5-7 illustrates the relationships between the blocksize register (BK), 
the auxiliary registers (ARn), the bottom of the circular buffer, the top of the cir- 
cular buffer, and the index into the circular buffer. 

A circular buffer of size R must start on an N-bit boundary (i.e., N LSBs of ad- 
dress are 0) where N is the smallest integer that satisfies 2N> R. Also, the value 
R must be loaded into the BK register. For example, a 31 -word circular buffer 
must start at an address whose 5 LSBs are (i.e., 
XXXXXXXXXXXXXXXXXXOOOOOOb) and 31 must be loaded into BK. 
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Figure 5-7. Flowchart for Circular Addressing 
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In circular addressing, index refers to the N LSBs of the auxiliary register se- 
lected, and step is the quantity being added to or subtracted from the auxiliary 
register. Follow these two rules when you use circular addressing: 

Qi The step used must be less than or equal to the blocksize. 

Qi The first time the circular queue is addressed, the auxiliary register must 
be pointing to an element in the circular queue. 

The algorithm for circular addressing is as follows: 

If < index + step < BK: 

index = index + step. 

Else if index + step > BK: 

index = index + step - BK. 

Else if index + step < 0: 

index = index + step + BK. 

Figure 5-8 shows how the circular buffer is implemented. It illustrates the rela- 
tionship of the quantities generated and the elements in the circular buffer. 



Figure 5-8. Circular Buffer Implementation 
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Figure 5-9 gives an example of the operation of circular addressing. Assum- 
ing that all ARs are four bits, let ARO = 0000,andBK= 0110 (blocksize of 6). 
This example shows a sequence of modifications and the resulting value of 
ARO. It also shows how the pointer steps through the circular queue with a vari- 
ety of step sizes (both incrementing and decrementing). 



Figure 5-9. Circular Addressing Example 
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Circular addressing is especially useful for the implementation of FIR filters. 
Figure 5-1 shows one possible data structure for FIR filters. Note that the ini- 
tial value of ARO points to h(N -1), and the initial value of AR1 points to x(0). 
Circular addressing is used in the TMS320C3x code for the FIR filter shown 
in Figure 5-11. 



Figure 5-10. Data Structure for FIR Filters 
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Figure 5-11. FIR Filter Code Using Circular Addressing 
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5.4 Bit-Reversed Addressing 



Bit-reversed addressing on the TMS320C3x enhances execution speed and 
program memory for FFT algorithms that use a variety of radices. The base 
address of bit-reversed addressing must be located on a boundary of the size 
of the table. For example, if IRO = 2 n_1 , the n LSBs of the base address must 
be zero. The base address of the data in memory must be on a 2 n boundary. 
One auxiliary register points to the physical location of a data value. IRO speci- 
fies one-half the size of the FFT; e.g., the value contained in IRO must be equal 
to 2 n_1 where n is an integer and the FFT size is 2 n . When you add IRO to 
the auxiliary register by using bit-reversed addressing, addresses are gener- 
ated in a bit-reversed fashion. 

To illustrate this kind of addressing, assume eight-bit auxiliary registers. Let 
AR2 contain the value 0110 0000 (96). This is the base address of the data in 
memory. Let IRO contain the value 0000 1000 (8). Figure 5-12 shows a se- 
quence of modifications of AR2 and the resulting values of AR2. 



Figure 5- 12. Bit-Reversed Addressing Example 
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Table 5-3 shows the relationship of the index steps and the four LSBs of AR2. 
As you can see, you can find the four LSBs by reversing the bit pattern of the 
steps. 



Table 5-3. Index Steps and Bit-Reversed Addressing 



Step 


Bit Pattern 


Bit-Reversed Pattern 


Bit-Reversed Step 





0000 


0000 





1 


0001 


1000 


8 


2 


0010 


0100 


4 


3 


0011 


1100 


12 


4 


0100 


0010 


2 


5 


0101 


1010 


10 


6 


0110 


0110 


6 


7 


0111 


1110 


14 


8 


1000 


0001 


1 


9 


1001 


1001 


9 


10 


1010 


0101 


5 


11 


1011 


1101 


13 


12 


1100 


0011 


3 


13 


1101 


1011 


11 


14 


1110 


0111 


7 


15 


1111 


1111 


15 



5-29 



System and User Stack Management 



5.5 System and User Stack Management 



The TMS320C3x provides a dedicated system stack pointer (SP) for building 
stacks in memory. The auxiliary registers can also be used to build a variety 
of more general linear lists. This section discusses the implementation of the 
following types of linear lists: 

Stack A linear list for which all insertions and deletions are made at one 
end of the list. 

Queue A linear list for which all insertions are made at one end of the list, 
and all deletions are made at the other end. 

Deque A double-ended queue linear list for which insertions and deletions 
are made at either end of the list. 

The system stack pointer (SP) is a 32-bit register that contains the address of 
the top of the system stack. The system stack fills from low-memory address 
to high-memory address (see Figure 5-13). The SP always points to the last 
element pushed onto the stack. A push performs a preincrement, and a pop 
performs a postdecrement of the system stack pointer. 

The program counter is pushed onto the system stack on subroutine calls, 
traps, and interrupts. It is popped from the system stack on returns. The system 
stack can be pushed and popped using the PUSH, POP, PUSHF, and POPF 
instructions. 



Figure 5-13. System Stack Configuration 



Low Memory 
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Top of Stack 



(Free) 
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Stacks can be built from low to high memory or high to low memory. Two cases 
for each type of stack are shown. Stacks can be built using the preincrement/ 
decrement and postincrement/decrement modes of modifying the auxiliary 
registers (AR). Stack growth from high-to-low memory can be implemented in 
two ways: 

CASE 1 : Stores to memory using * — ARn to push data onto the stack and 
reads from memory using *ARn ++ to pop data off the stack. 

CASE 2: Stores to memory using *ARn to push data onto the stack and 

reads from memory using * ++ ARn to pop data off the stack. 

Figure 5-14 illustrates these two cases. The only difference is that in case 1 , 
the AR always points to the top of the stack, and in case 2, the AR always points 
to the next free location on the stack. 



Figure 5- 14. Implementations of High-to-Low Memory Stacks 



ARn -> 



Case 1 
Low Memory 




Case 2 
Low Memory 


(Free) 


ARn -» 


(Free) 


Top of Stack 


Top of Stack 






Bottom of Stack 


Bottom of Stack 


High Memory 




High Memory 



Stack growth from low-to-high memory can be implemented in two ways: 

CASE 3: Stores to memory using *++ ARn to push data onto the stack and 
reads from memory using *ARn to pop data off the stack. 

CASE 4: Stores to memory using *ARn ++ to push data onto the stack and 
reads from memory using * ARn to pop data off the stack. 

Figure 5-1 5 shows these two cases. In the case 3, the AR always points to the 
top of the stack. In case 4, the AR always points to the next free location on 
the stack. 
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Figure 5- 15. Implementations of Low-to-High Memory Stacks 
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5.5.2 Queues 



A queue is like a FIFO. The implementation of queues is based upon the ma- 
nipulation of auxiliary registers. Two auxiliary registers are used, one to mark 
the front of the queue from which data is popped (or dequeued) and the other 
to mark the rear of the queue where data is pushed. By properly managing the 
auxiliary registers, the queue may also be circular. (A queue is circular when 
the rear pointer is allowed to point to the beginning of the queue memory after 
it has pointed to the end of the queue memory.) 
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Chapter 6 



The TMS320C3x provides a complete set of constructs that facilitate software 
and hardware control of the program flow. Software control includes repeats, 
branches, calls, traps, and returns. Hardware control includes operations, re- 
set, and interrupts. Because programming includes a variety of constructs, you 
can select the one suited for your particular application. 

Several interlocked operations instructions provide flexible multiprocessor 
support and, through the use of external signals, a powerful means of 
synchronization. They also guarantee the integrity of the communication and 
result in a high-speed operation. 

The TMS320C3x supports a nonmaskable external reset signal and a number 
of internal and external interrupts. These functions can be programmed for a 
particular application. 

This chapter discusses the following major topics: 

□ Repeat Modes (Section 6.1 on page 6-2) 

n Initialization 

El Operation 
Q Delayed Branches (Section 6.2 on page 6-7) 
Qi Calls, Traps, and Returns (Section 6.3 page 6-8) 
Qi Interlocked Operations (Section 6.4 on page 6-1 0) 
d Reset Operation (Section 6.5 on page 6-1 6) 
Q Interrupts (Section 6.6 on page 6-20) 
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6.1 Repeat Modes 



The repeat modes of the TMS320C3x can implement zero-overhead looping. 
For many algorithms, most execution time is spent in an inner kernel of code. 
Using the repeat modes allows these time-critical sections of code to be ex- 
ecuted in the shortest possible time. 

The TMS320C3x provides two instructions to support zero-overhead looping: 
RPTB (repeat a block of code) and RPTS (repeat a single instruction). RPTB 
causes a block of code to be repeated a specified number of times. RPTS 
causes a single instruction to be repeated a number of times and reduces the 
bus traffic by fetching the instruction only once. 

Three registers (RS, RE, and RC) are associated with the updating of the pro- 
gram counter when it is updated in a repeat mode. Table 6-1 describes these 
registers. 



Table 6-1. Repeat-Mode Registers 



Register 


Function 


RS 


Repeat Start Address Register. Holds the address of the first instruction 
of the block of code to be repeated. 


RE 


Repeat End Address Register. Holds the address of the last instruction 
of the block of code to be repeated. 


RC 


Repeat-Count Register. Contains one less than the number of times 
the block remains to be repeated. For example, to execute a block N 
times, load N-1 into RC. 



6.1.1 Repeat-Mode Initialization 



Two bits are important to the operation of RPTB and RPTS: the RM and S bits. 

The RM (repeat-mode flag) bit in the status register specifies whether the pro- 
cessor is running in the repeat mode. If RM = 0, fetches are not made in repeat 
mode. If RM = 1 , fetches are made in repeat mode. 

The S bit is hidden from the user but is necessary to fully describe the operation 
of RPTB and RPTS. If S = 0, the CPU is not performing fetches in the repeat- 
single mode. If S = 1 and RM = 1 , the CPU is performing fetches in the repeat- 
single mode. 

The correct operation of the repeat modes requires that all of the above regis- 
ters and status register fields be initialized correctly. The RPTB and RPTS in- 
structions perform this initialization in slightly different ways (see Sections 
6.1.2 and 6.1.3). 
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6.1.2 RPTB Initialization 

When RPTB src is executed, the following operations take place: 

1) PC +1 ->RS 

2) src -> RE 

3) 1 -> RM status register bit 

4) -> S bit. 

Step 1 loads the start address of the block into RS. Step 2 loads the src into 
the RE (end address of the block). The srcoperand is a 24-bit value contained 
in the instruction word. Step 3 sets the status register to indicate the repeat 
mode of operation. Step 4 indicates that this is the repeat block mode of opera- 
tion. 

The last bit of information required is the number of times to repeat the block. 
The value is determined by properly initializing the RC (repeat count) register. 
Since the execution of RPTB does not load the RC, you must load this register 
yourself. The typical setup of the block repeat operation is shown below. 

LDI 15, RC ; 15 — > RC, LOOP will be executed 16 times 
RPTB LOOP ; LOOP -^ RE, PC + 1 RS, 1 -> RM, -> S 

The repeat modes repeat a block of code at least once in a typical operation. 
The repeat counter should be loaded with one less than the number of times 
to repeat the block; i.e., a value of in RC repeats the block of code one time. 
All block repeats initiated by RPTB can be interrupted. 

6.1.3 RPTS Initialization 

When RPTS src is executed, the following sequence of operations occurs: 

1) PC + 1->RS 

2) PC + 1 -^ RE 

3) 1 -^> RM status register bit 

4) 1 -> S bit 

5) src -> RC 

The RPTS instruction loads all registers and mode bits necessary for the oper- 
ation of the single instruction repeat mode. Step 1 loads the start address of 
the block into RS. Step 2 loads the end address into the RE (end address of 
the block). Since this is a repeat of a single instruction, the start address and 
the end address are the same. Step 3 sets the status register to indicate the 
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repeat mode of operation. Step 4 indicates that this is the repeat single-instruc- 
tion mode of operation. The operand src is loaded into RC, as in Step 5 and 
the following instruction is executed src+1 times. 

Repeats of a single instruction initiated by RPTS are not interruptible, because 
the RPTS fetches the instruction word only once and then keeps it in the in- 
struction register for reuse. An interrupt would cause the instruction word to be 
lost. The refetching of the instruction word from the instruction register reduces 
memory accesses and, in effect, acts as a one-word program cache. If it is nec- 
essary to have a single instruction that is repeatable and interruptible, you can 
use the RPTB instruction. 



6.1 .4 Repeat-Mode Operation 



The information in the repeat-mode registers and associated control bits is 
used to control the modification of the PC when the fetches are being made 
in repeat mode. The repeat modes compare the contents of the RE register 
with the program counter (PC) . If they match and the repeat counter is nonneg- 
ative, the repeat counter is decremented, the PC is loaded with the repeat start 
address, and the processing continues. The fetches and appropriate status 
bits are modified as necessary. Note that the repeat counter (RC) is never mo- 
dified when RM is 0. The maximum number of repeats occurs. when RC = 
080000000h. This will result in 080000001 h repetitions. The detailed algorithm 
for the update of the PC is described in Figure 6-1 . 



Figure 6-1. Repeat-Mode Control Algorithm 



if RM == 1 
if S == 1 

if first time through 

fetch instruction from memory 
else 

fetch instruction from IR 
RC - 1 -> RC 
if RC < 

-> ST(RM) 
-> S 

PC + 1 -> PC 
else if S == 
fetch instruction from memory 
if PC == RE 

RC - 1 -» RC 
if RC > 

RS -» PC 
else if RC < 

-» ST(RM) 

-> S 

PC + 1 -> PC 



If in repeat mode (RPTB or RPTS) 

If RPTS 

If this is the first fetch 

Fetch instruction from memory 

If not the first fetch 

Fetch instruction from IR 

Decrement RC 

If RC is negative 

Repeat single mode completed 

Turn off repeat mode bit 

Clear S 

Increment PC 

If RPTB 

Fetch instruction from memory 

If this is the end of the block 

Decrement RC 

If RC is not negative 

Set PC to start of block 

If RC is negative 

Turn off repeat mode bits 

Clear S 

Increment PC 
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The RPTB and RPTS are four-cycle instructions. These four cycles of 
overhead are incurred only on the first pass through the loop. All subsequent 
passes through the loop are accomplished with zero cycles of overhead. In 
Example 6-1 , the block of code from STLOOP to ENDLOP is repeated sixteen 
times. 



Example 6-1. RPTB Operation 





LDI 


15, RC 




RPTB 


ENDLOP 


STLOOP 






ENDLOP 







; Load repeat counter with 15 
; Execute the block, of code 

from STLOOP to ENDLOP 16 times 



Using the repeat block mode of modifying the PC. facilitates analysis of what 
would happen in the case of branches within the block. Assume that the next 
value of the PC will be either PC + 1 or the contents of the RS register. It is thus 
apparent that this method of block repeat allows many amount of branching 
within the repeated block. Execution can go anywhere within the user's code 
via interrupts, subroutine calls, etc. For proper modification of the loop counter, 
the last instruction of the loop must be fetched. You can stop the repeating of 
the loop prior to completion by writing a into the repeat counter or writing 
into the RM bit of the status register. 

Since the block repeat modes modify the program counter, other instructions 
cannot modify the program counter at the same time. Two rules apply here: 

1 ) The last instruction in the block (or the only instruction in a block of 
size one) cannot be a Bcond, BR, DBcond, CALL, CALLcond, TRAP- 
cond, RETIcond, RETScond, IDLE, RPTB, or RPTS. Example 6-2 
shows an incorrectly placed standard branch. 

2) None of the last four instructions from the bottom of the block (or the 
only instruction in a block of size one) can be a BcondD, BRD, or 
DBcondD. Example 6-3 shows an incorrectly placed delayed branch. 

If either of these rules is violated, the PC will be undefined. 
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Example 6-2. Incorrectly Placed Standard Branch 



LDI 
RPTB 



STLOOP 



15, RC 
ENDLOP 



Load repeat counter with 15 

Execute block of code 

from STLOOP to ENDLOP 16 times 



ENDLOP BR 



OOPS 



This branch violates rule 1 



Example 6-3. Incorrectly Placed Delayed Branch 



LDI 
RPTB 



STLOOP 



15, RC 
ENDLOP 



Load repeat counter with 15 

Execute block of code 

from STLOOP to ENDLOP 16 times 



BRD OOPS ; This branch violates rule 2 

ADDF 
MPYF 
ENDLOP SUBF 

Block repeats (RPTB) are nestable. Since all of the control is defined by the 
RS, RE, RC, and ST registers, these registers must be saved and stored in or- 
der to nest block repeats. The RM bit in the status register can be used to deter- 
mine if the block repeat mode is active. For example, if you write an interrupt 
service routine that requires the use of RPTB, it is possible that the interrupt 
associated with the routine may occur during another block repeat. The inter- 
rupt service routine can check the RM bit. If this bit is set, the interrupt routine 
saves RS, RE, RC, and ST. The interrupt routine can then perform a block re- 
peat. Before returning to the interrupted routine, the interrupt routine restores 
RS, RE, RC, and ST. If the RM bit is not set, you don't need to save and restore 
these registers. 
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6.2 Delayed Branches 



The TMS320C3x offers two main types of branching: standard and delayed. 
Standard branches empty the pipeline before performing the branch; this guar- 
antees correct management of the program counter and results in a 
TMS320C3x branch taking four cycles. Included in this class are repeats, calls, 
returns, and traps. 

Delayed branches on the TMS320C3x do not empty the pipeline, but rather 
guarantee that the next three instructions will execute before the program 
counter is modified by the branch. The result is a branch that requires only a 
single cycle, thus making the speed of the delayed branch very close to the op- 
timal block repeat modes of the TMS320C3x. However, unlike block repeat 
modes, delayed branches may be used in situations other than looping. Every 
delayed branch has a standard branch counterpart that is used when a 
delayed branch cannot be used. The delayed branches of the TMS320C3x are 
BcondD, BRD, and DBcondD. 

Conditional delayed branches use the conditions that exist at the end of the 
instruction immediately preceding the delayed branch. They do not depend 
upon the instructions following the delayed branch. The condition flags are set 
by a previous instruction only when the destination register is one of the exten- 
ded-precision registers (R0-R7) or when one of the compare instructions 
(CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) is executed. Delayed 
branches guarantee that the next three instructions will execute, regardless 
of other pipeline conflicts. 

When a delayed branch is fetched, it remains pending until the three following 
instructions are executed. None of the three instructions that follow a delayed 
branch can be Bcond, BcondD, BR, BRD, DBcond, DBcondD, CALL, CALL- 
cond, TRAPcond, RET\cond, REJScond, RPTB, RPTS, or IDLE, (see 
Example 6-4). 

Delayed branches disable interrupts until the three instructions following the 
delayed branch are completed. This is independent of whether or not the 
branch is taken. 

If delayed branches are used incorrectly, the PC will be undefined. 



Example 6-4. incorrectly Placed Delayed Branches 



Bl 



B2 



BD 

NOP 

NOP 

B 

NOP 

NOP 

NOP 



LI 



L2 



This branch is incorrectly placed 



6-7 



Calls, Traps, and Returns 



6.3 Calls, Traps, and Returns 

Calls and traps provide a means of executing a subroutine or function while 
providing a return to the calling routine. 

The CALL, CALLcond, and TRAPcond instructions store the value of the PC 
on the stack before changing the PC's contents. The stack thus provides a 
return using either the RETScond or RETIcond instruction. 

Ql The CALL instruction places the next PC value on the stack and places 
the src (source) operand into the PC. The src is a 24-bit immediate value. 
Figure 6-2 shows CALL response timing. 

Ql The CALLconof instruction is similar to the CALL instruction (above) except 
that (1) it executes only if a specific condition is true (the 20 condi- 
tions — including unconditional — are listed in Section 10.2) and (2) the 
src is either a PC-relative displacement or in register addressing mode. 
The condition flags are set by a previous instruction only when the destina- 
tion register is one of the extended-precision registers (R0-R7) or when 
one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or 
TSTB3) is executed. 

Ql The TRAPcond instruction also executes only if a specific condition is true 
(same conditions as for the CALLcond instruction). When executing, (1) 
interrupts are disabled with written to bit GIE of the ST, (2) the next PC 
value is stored on the stack, and (3) a vector is retrieved from one of the 
addresses 20h to 3Fh and loaded into the PC. The particular address is 
identified by a trap number in the instruction. Using the RETIcondto return 
re-enables interrupts. 

□i RETScond returns execution from any of the above three instructions by 
popping the top of the stack to the PC. To execute, the specified condition 
must be true. Conditions are the same as for the CALLcond instruction. 

Q RETIcond returns from traps or calls similar to the RETScond (above) with 
the addition that RETIcond also sets the GIE bit of the Status Register 
which thus enables all interrupts whose enabling bit is set to 1 . Conditions 
are the same as for the CALLcond instruction. 

Functionally, calls and traps accomplish the same task (i.e., a subfunction is 
called, executed, and control then returned to the calling function. Traps offer 
several advantages: 

1 ) Interrupts are automatically disabled when a trap is executed. This allows 
critical code to execute without risk of being interrupted. Thus, traps are 
generally terminated with a RETIcond instruction to re-enable interrupts. 

2) You can use traps to indirectly call functions. This is particularly beneficial 
when a kernel of code contains the basic subfunctions to be used by appli- 
cations. In this case, the functions in the kernel can be modified and relo- 
cated without recompiling each application. 
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Calls, Traps, Returns 



Figure 6-2. CALL Response Timing 
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6.4 Interlocked Operations 



One of the most common multiprocessing configurations is the sharing of glob- 
al memory by multiple processors. In order for multiple processors to access 
this global memory and share data in a coherent manner, some sort of arbitra- 
tion or handshaking is necessary. This requirement for arbitration is the pur- 
pose of the TMS320C3x interlocked operations. 

The TMS320C3x provides a flexible means of multiprocessor support with five 
instructions, referred to as interlocked operations. Through the use of external 
signals, these instructions provide powerful synchronization mechanisms. 
They also guarantee the integrity of the communication and result in a high- 
speed operation. The interlocked-operation instruction group is listed in 
Table 6-2. 



Table 6-2. Interlocked Operations 



Mnemonic 


Description 


Operation 


LDFI 


Load floating-point value into a register, 
interlocked 


Signal interlocked 
src — > dst 


LDII 


Load integer into a register, interlocked 


Signal interlocked 
src — > dst 


SIGI 


Signal, interlocked 


Signal interlocked 
Clear interlock 


STFI 


Store floating-point value to memory, 
interlocked 


src — > dst 
Clear interlock 


STII 


Store integer to memory, interlocked 


src — > dst 
Clear interlock 



The interlocked operations use the two external flag pins, XFO and XF1 . XFO 
must be configured as an output pin, and XF1 as an input pin. When configured 
in this manner, XFO signals an interlock operation request, and XF1 acts as an 
acknowledge signal for the requested interlocked operation. In this mode, XFO 
and XF1 are treated as active-low signals. 

The external timing for the interlocked loads and stores is the same as for stan- 
dard load and stores. The interlocked loads and stores maybe ext ended like 
stand ard ac cesses by using the appropriate ready signal (RDYj nt or XRDYj nt ). 
(RDYj nt and XRDYj nt are a combination of external ready input and software 
wait states. Refer to Chapter 7, External Bus Operation, for more information 
on ready generation.) 

The LDFI and LDII instructions perform the following actions: 

1 ) Simultaneously set XFO to and begin a read cycle. The timing of XFO 
is similar to that of the address bus during a read cycle. 
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2) Execute an LDF or LDI instru ctio n and e xtend the read cycle until XF1 
is set to and a ready (RDYj nt or XRDYj nt ) is signalled. 

3) Leave XFO set to and end the read cycle. 

The read/write operation is identical to any other read/write cycle except for the 
special use of XFO and XF1 . The srcoperand for LDFI and LDII is always a di- 
rect ojMndre^tjTiejTiory addre^ XFO is setto only if the srcis located off-chip; 
i.e., STRB, MSTRB, or IOSTRB is active, orthe srcis one of the on-chip periph- 
erals. If on-chip memory is accessed, then XFO is not asserted, and the opera- 
tion is as an LDF or LDI from internal memory. 

The STFI and STII instructions perform .the following operations: 

1 ) Simultaneously set XFO to 1 and begin a write cycle. The timing of XFO 
is similar to that of the address bus during a write cycle. 

2) Execu te a n STF o r STI instruction and extend the write cycle until a ready 
(RDY int or XRDY int ) is signaled. 

As in the case for LDFI and LDII, th e dst of STF I and STII affects XFO. If dst 
is located off-chip (STRB, MSTRB, or IOSTRB is active) or the dst is one of 
the on-chip peripherals, XFO is set to a 1 . If on-chip memory is accessed, then 
XFO is not asserted and the operations are as an STF or STI to internal 
memory. 

The SIGI instruction functions as follows: 

1) Sets XFO toO. 

2) Idles until XF1 is set to 0. 

3) Sets XFO to 1 and ends the operation. 

While the LDFI, LDII, and SIGI instructions are waiting for XF1 to b e set to 0, 
you ca n interrupt them. LDFI and LDII require a ready signal (RDYj nt or 
XRDYj n t) in order to be interrupted. Because interrupts are taken on bus cycle 
boundaries (see Section 6.6), an interrupt may be taken any time after a valid 
ready. This allows you to implement protection mechanisms against deadlock 
conditions by interrupting an interlocked load that has taken too long. Upon re- 
turn from the interrupt, the next instruction is executed. The STFI and STII in- 
structions are not interruptible. Since the STFI and STII instructions complete 
when ready is signaled, the delay until an interrupt can occur is the same as 
for any other instruction. 

Interlocked operations can be used to implement a busy-waiting loop, to ma- 
nipulate a multiprocessor counter, to implement a simple semaphore mecha- 
nism, or to perform synchronization between two TMS320C3xs. The following 
examples illustrate the usefulness of the interlocked operations instructions. 
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Example 6-5 shows the implementation of a busy-waiting loop. If location 
LOCK is the interlock for a critical section of code, and a nonzero means the 
lock is busy, the algorithm for a busy-waiting loop can be used as shown. 



Example 6-5. Busy-Waiting Loop 



LDI 1,R0 

LI: LDII @LOCK,Rl 

STII R0,@LOCK 

BNZ LI 



Put 1 in RO 

Interlocked operation begun 

Contents of LOCK -> Rl 

Put RO (= 1) into LOCK, XFO 

Interlocked operation ended 

Keep trying until LOCK = 



Example 6-6 shows how a location COUNT may contain a count of the number 
of times a particular operation needs to be performed. This operation may be 
performed by any processor in the system. If the count is zero, the processor 
waits until it is nonzero before beginning processing. The example also shows 
the algorithm for modifying COUNT correctly. 



Example 6-6. Multiprocessor Counter Manipulation 



CT: OR 4,IOF 

LDII @C0UNT,R1 

BZ CT 

SUBI 1,R1 

STII R1,@C0UNT 



XFO = 1 

Interlocked operation ended 
Interlocked operation begun 
Contents of COUNT -> Rl 
If COUNT = 0, keep trying 
Decrement Rl (= COUNT) 
Update COUNT, XFO = 1 
Interlocked operation ended 



Figure 6-3 illustrates multiple TMS320C3xs sharing global memory and using 
the interlocked instructions as in Example 6-7, Example 6-8, and 
Example 6-9. 
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Figure 6-3. Multiple TMS320C3xs Sharing Global Memory 
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Sometimes it may be necessary for several processors to access some shared 
data or other common resources. The portion of code that must access the 
shared data is called a critical section. 

To ease the programming of critical sections, semaphores may be used. 
Semaphores are variables that can take only non-negative integer values. 
Two primitive, indivisible operations are defined on semaphores (with S being 
a semaphore): 



V(S) 
P(S) 



S + 1 —> s 

if (S == 0) , go to P 

else S - 1 — > S 



Indivisibility of V(S) and P(S) means that when these processes access and 
modify the semaphore S, they are the only processes accessing and modify- 
ing S. 

To enter a critical section, a P operation is performed on a common sema- 
phore, say S (S is initialized to 1 ). The first processor performing P(S) will be 
able to enter its critical section. All other processors are blocked because S 
has become 0. After leaving its critical section, the processor performs a V(S), 
thus allowing another processor to execute P(S) successfully. 
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The TMS320C3x code for V(S) is shown in Example 6-7, and code for P(S) 
is shown in Example 6-8. Compare the code in Example 6-8 to the code in 
Example 6-6. 



Example 6-7. Implementation of V(S) 



V: LDII @S,RO 

ADDI 1,R0 
STII R0,@S 



Example 6-8. Implementation ofP(S) 



OR 4,IOF 

LDII @S,R0 

BZ P 

SUBI 1,R0 

STII R0,@S 



Interlocked read of S begins (XFO = 0) 

Contents of S -4 R0 

Increment R0 (= S) 

Update S, end interlock (XFO = 0) 



End interlock (XFO = 1) 

Interlocked read of S begins 

Contents of S — > R0 

If S = 0, go to P and try again 

Decrement R0 (= S) 

Update S, end interlock (XFO = 1) 



The SIGI operation may be used to synchronize, at an instruction level, multi- 
ple TMS320C3xs. Consider two processors connected as shown in 
Figure 6-4. The code for the two processors is shown in Example 6-9. 



Figure 6-4. Zero-Logic Interconnect of TMS320C3xs 



TMS320C3X #1 




TMS320C3X #2 


XFO 
XF1 




XF1 
XFO 







Processor #1 runs until it executes the SIGI. It then waits until processor #2 
executes a SIGI. At this point, the two processors have synchronized and con- 
tinue execution. 
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Example 6-9. Code to Synchronize Two TMS320C3xs at the Software Level 



Time Code for TMS320C3x #1 
O o 

© 
o 

SIGI 



V 

(WAIT) 



V. 

O 
O 

o 

© 



CodeforTMS320C3x#2 
o 
o 
o 
© 
o 
o 
o 
o 
o 



synchronization Occurs- 



o 

SIGI 
o 
© 
© 

© 
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6.5 Reset Operation 



The TMS320C3x supports a nonmaskable external reset signal (RESET), 
which is used to perform system reset. This section discusses the reset opera- 
tion. 

At powerup , the state of the TMS320C3x processor is undefined. You can use 
the RESET signal to place the processor in a known state. This signal must 
be asserted low for 1 or more H1 clock cycles to guarantee a system reset. 
H1 is an output clock signal generated by the TMS320C3x (see Chapter 1 3 for 
more information). 

Reset affects the other pins on the device in either a synchronous or asynchro- 
nous manner. The synchronous reset is gated by the TMS320C3x's internal 
clocks. The asynchronous reset directly affects the pins, and is faster than the 
synchro nous reset.Table 6-3 shows the state of the TMS320C3x's pins after 
RESET = 0. Each pin is described according to whether the pin is reset syn- 
chronously or asynchronously. 



Table 6-3. Pin Operation at Reset 



Signal 


#Pins 


Operation at Reset 


Primary Interface (61 Pins) 


D31 —DO 


32 


Synchronous reset. Placed in high-impedance state. 


A23 — AO 


24 


Synchronous reset. Placed in high-impedance state. 


R/W 


1 


Synchronous reset. Placed in high-impedance state. 


STRB 


1 


Synchronous reset. Deasserted by going to a high level. 


RDY 


1 


Reset has no effect. 


HOLD 


1 


Reset has no effect. 


HOLDA 


1 


Reset has no effect. 


Expansion Interface (49 Pins)t 


XD31— XDO 


32 


Synchronous reset. Placed in high-impedance state. 


XA12 — XAO 


13 


Synchronous reset. Placed in high-impedance state. 


XRA/V 


1 


Synchronous reset. Placed in high-impedance state. 


MSTRB 


1 


Synchronous reset. Deasserted by going to a high level. 


IOSTRB 


1 


Synchronous reset. Deasserted by going to a high level. 


XRDY 


1 


Reset has no effect. 


Control Signals (9 Pins) 


RESET 


1 


Reset input pin 


TnT3— Into 


4 


Reset has no effect. 


IACK 


1 


Synchronous reset. Deasserted by going to a high level. 


MC/MP or 
MCBL/MP 


1 


Reset has no effect. 


XFT — XFO 


2 


Asynchronous reset. Placed in high-impedance state. 



t Present only on TMS320C30 
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Table 6-3. Pin Operation at Reset (Continued) 



Signal 


#Pins 


Operation at Reset 


Serial Port Signals (6 Pins) 


CLKXO 




Asynchronous reset. Placed in high-impedance state. 


DXO 




Asynchronous reset. Placed in high-impedance state. 


FSXO 




Asynchronous reset. Placed in high-impedance state. 


CLKRO 




Asynchronous reset. Placed in high-impedance state. 


DRO 




Asynchronous reset. Placed in high-impedance state. 


FSRO 




Asynchronous reset. Placed in high-impedance state. 


Serial Port 1 Signals (6 Pins) t 


CLKX1 




Asynchronous reset. Placed in high-impedance state. 


DX1 




Asynchronous reset. Placed in high-impedance state. 


FSX1 




Asynchronous reset. Placed in high-impedance state. 


CLKR1 




Asynchronous reset. Placed in high-impedance state. 


DR1 




Asynchronous reset. Placed in high-impedance state. 


FSR1 




Asynchronous reset. Placed in high-impedance state. 


Timer Signal (1 Pin) 


TCLKO 




Asynchronous reset. Placed in high-impedance state. 


Timer 1 Signal (1 Pin) 


TCLK1 




Asynchronous reset. Placed in high-impedance state. 


Supply and Oscillator Signals (29 Pins) 


V DD (3-0) 


4 


Reset has no effect. 


IODV DD (1,0) 


2 


Reset has no effect. 


ADVddO.O) 


2 


Reset has no effect. 


pdv dd 


1 


Reset has no effect. 


DDV DD (1,0) 


2 


Reset has no effect. 


MDV DD 


1 


Reset has no effect. 


VssO — 0) 


4 


Reset has no effect. 


DV S s(3-0) 


2 


Reset has no effect. 


CVssO.O) 


2 


Reset has no effect. 


ivss 




Reset has no effect. 


vbbp 




Reset has no effect. 


SUBS 




Reset has no effect. 


X1 




Reset has no effect. 


X2/CLKIN 




Reset has no effect. 


H1 




Synchronous reset. Will go to its initial state when RESET 
makes a 1 to transition. See Appendix A. 


H3 


1 


Synchronous reset. Will go to its initial state when RESET 
makes a 1 to transition. See Appendix A. 



t Present only on TMS320C30 
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Table 6-3. Pin Operation at Reset (Continued) 



Signal 


#Pins 


Operation at Reset 


Emulation, Test, and Reserved (18 Pins) 


EMUO 




Undefined. 


EMU1 




Undefined. 


EMU2 




Undefined. 


EMU3 




Undefined. 


EMU4/SHZ 




Undefined. 


EMU5t 




Undefined. 


EMU6t 




Undefined. 


RSVot 




Undefined. 


RSVlt 




Undefined. 


RSV2t 




Undefined. 


RSV3t 




Undefined. 


RSV4t 




Undefined. 


RSV5t . 




Undefined. 


RSV6t 




Undefined. 


RSV7t 




Undefined. 


RSV8t 




Undefined. 


RSV9t 




Undefined. 


RSVIOt 




Undefined. 



t Present only on TMS320C30 
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At system reset, the following additional operations are performed: 

Qi The peripherals are reset. This is a synchronous operation. The peripheral 
reset is described in Chapter 8. 

□i The following CPU registers are loaded with zero: 
m ST (CPU status register) 
H IE (CPU/DMA interrupt enable flags) 
E IF (CPU interrupt flags) 
H IOF (I/O flags) 

Qi The reset vector is read from memory location Oh and loaded into the PC. 
This vector contains the start address of the system reset routine. 

Qi Execution begins. Refer to Section 11.1 for an example of a processor 
initialization routine. 

Multiple TMS320C3xs driven by the sa me syste m clock may be reset and syn- 
chronized. When the 1 to transition of RESET occurs, the processor is placed 
on a well-defined internal phase, and all of the TMS320C3xs will come up on 
the same internal phase. 
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6.6 Interrupts 



The TMS320C3x supports multiple internal and external interrupts, which can 
be used for a variety of applications. This section discusses the operation of 
these interrupts. 

A functional diagram of the logic used to implement the external interrupt inputs 
is shown in Figure 6-5; the logic for internal interrupts is similar. Additional in- 
formation regarding internal interrupts can be found in Chapter 8. 



Figure 6-5. Interrupt Logic Functional Diagram 



INTn 



D Q 
CLK 



H1 



H3 
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CLK 
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EINTn(CPU) 
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H1 
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GIE(CPU) 






to- 



Internal 

Interrupt 

Processor 



To 
-► Control 
Section 



•- GIE(DMA) 
EINTn(DMA) 



External interrupts are synchronized internally, as illustrated by the three flip- 
flops clocked by H 1 and H3. Once synchronized, the interrupt input will set the 
corresponding interrupt flag register (IF) bit if the interrupt is active. 

External interrupts are latched internally on the falling edge of H1 (see the data 
sheet for timing information). An external interrupt must be held low for at least 
one H1/H3 cycle to be recognized by the TMS320C3x. Interrupts should be 
held low for only one or two H1 falling edges. If the interrupt is held low for three 
or more H1 falling edges, multiple interrupts may be recognized. 



6.6.1 Interrupt Control Bits 



When a particular interrupt is processed by the CPU or DMA controller, the cor- 
responding interrupt flag bit is cleared by the in ternal interrupt acknowledge 
signal. It should be noted, however, that if INTn is still low when the interrupt 
acknowledge signal occurs, the interr upt flag bit will be cleared only for one 
cycle and then set again because INTn is still low. Accordingly, it is theoretically 
possible that , dep ending on when the IF register is read, this bit may be zero 
even though INTn is zero. When the TMS320C3x is reset, zero is written to the 
interrupt flag register, thereby clearing all pending interrupts. 
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The interrupt flag register bits may be read and written under software control. 
Writing a 1 to an IF register bit sets the associated interrupt flag to 1 . Similarly, 
writing a resets the corresponding interrupt flag to 0. In this way, all interrupts 
may be triggered and/or cleared through software. Since the interrupts flags 
may be read, the interrupt pins may be polled in software when an interrupt-dri- 
ven interface is not required. 

Internal interrupts operate in a similar manner. In the IF register, the bit corre- 
sponding to an internal interrupt may be read and written through software. 
Writing a 1 sets the interrupt latch, and writing a clears it. All internal interrupts 
are one H1/H3 cycle in length. 

The CPU global interrupt enable bit (GIE), located in the CPU status register 
(ST), controls all CPU interrupts. All DMA interrupts are controlled by the DMA 
global interrupt enable bit, which is not dependent upon ST(GIE) and is local 
to the DMA. The DMA global interrupt enable bit is dependent, in part, upon 
the state of the DMA SYNCH bits. It is not directly accessible through software 
(see Chapter 8). The AND of the interrupt flag bit and the interrupt enables is 
then connected to the interrupt processor. 

To provide for maximum performance in servicing interrupts, the interrupt ac- 
knowledge (IACK) instruction is provided. IACK drives the IACK pin and per- 
forms a dummy read. The read is performed from the address specified by the 
IACK instruction operand. When IACK is used, it typically is placed in the early 
portion of an interrupt service routine. For certain applications, it may be better 
suited at the end of the interrupt service routine or be totally unnecessary. 



6.6.2 TMS320C3x Interrupt Considerations 



Give careful consideration to TMS320C3x interrupts, especially if user modifi- 
cations are made to the status register when the global interrupt enable (GIE) 
bit is set. This can result in the GIE bit being erroneously set or reset as de- 
scribed in the following paragraphs. 

The GIE bit is set to (zero) by an interrupt. This may cause a processing error 
if any code following within two cycles of the interrupt recognition attempts to 
read or modify the status register. For example, if the status register is being 
pushed onto the stack, it will be stored incorrectly if an interrupt was acknowl- 
edged two cycles before the store instruction. 

When an interrupt signal is recognized, the TMS320C3x continues executing 
the instructions already in the read and decode phases in the pipeline. Howev- 
er, because the interrupt is acknowledged, the GIE bit is reset to 0, and the 
store instruction already in the pipeline will store the wrong status register val- 
ue. For example, if the program is something like: 
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NOP 

interrupt recognized — > LDI @V_ADDR, AR1 

MPYI *AR1, RO 

PUSH ST 

POP ST 

the PUSH ST instruction will save the ST contents in memory, which includes 
GIE = 0. Since the programmer expects GIE = 1 , the POP ST instruction will 
put the wrong status register value into the ST 

A similar situation may occur if the GIE bit=1 and an instruction executes that 
intends to modify the other status bits and leave the GIE bit set. In the above 
example, this erroneous setting would occur if the interrupt is recognized two 
cycles before the POP ST instruction. In that case, the interrupt would clear 
the GIE bit, butthe execution of the POP instruction would setthe GIE bit. Since 
the interrupt has been recognized, the interrupt service routine will be entered 
with interrupts enabled, rather than disabled as expected. 

One solution is to make use of traps. For example, you can use TRAP to reset 
GIE and use TRAP 1 to set GIE. This is accomplished by making TRAP and 
TRAP 1 be the instructions RETS and RETI, respectively. 

Another alternative incorporates the following code fragment, which protects 
from modifying or saving of the status register by disabling interrupts through 
the interrupt enable register: 

Save IE register • Added instructions to 

Clear IE register avoid pipeline problems. 

• 2 NOPs or useful instructions 

set gie = • Instruction that reads or 

writes to ST register. 
Added instruction 
to avoid pipeline 
problems. 

6.6.3 TMS320C30 Interrupt Considerations 

The TMS320C30 has two additional exceptions to the interrupt operation. 

1) The status register global interrupt enable (GIE) bit may be erroneously 
reset to (disabled setting) if all of the following conditions are true: 

Qi a conditional trap instruction (TRAPcond) has been fetched, 

Of the condition for the trap is false, and 

Q a pipeline conflict has occurred, resulting in a delay in the decode or 
read phases of the instruction. 

During the decode phase of a conditional trap, interrupts are temporarily 
disabled to guarantee that the trap will execute prior to a subsequent inter- 
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PUSH 


IE 


; 


LDI- 


0, IE 


; 


NOP 




; 


NOP 




; 


AND 


ODFFFh, 


ST 


POP 


IE 
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rupt. If a pipeline conflict occurs, causing a delay in execution of the condi- 
tional trap, the interrupt disabled condition may become the last known 
condition of the GIE bit. In the case that the trap condition is false, inter- 
rupts will be permanently disabled until the GIE bit is intentionally set. The 
condition does not present itself when the trap condition is true, because 
normal operation of the instruction causes the GIE to be reset, and stan- 
dard coding practice will set the GIE to a one before the trap routine is ex- 
ited. Several instruction sequences that may cause pipeline conflicts have 
been found: 



a) 




LDI 


mem, SP 






TRAPcond 


n 


b) 




LDI 
NOP 


mem, SP 






TRAPcond 


n 


c) 




STI 


SP,mem 






TRAPcond 


n 


d) 




STI 


R:-:, *ARy 






LDI 


*AR:-:,Ry 




I I 


LDI 


*ARz,Rw 






TRAPcond 


n 



Other similar conditions may also cause a delay in the execution. There- 
fore, the following solution is recommended to avoid or rectify the problem. 

Insert two NOP instructions immediately prior to the TRAPcond instruc- 
tion. One NOP is insufficient in some cases, as illustrated in case 2 above. 
This eliminates opportunity for any pipeline conflicts in the immediately 
preceding instructions and enables the conditional trap instruction to ex- 
ecute without delays. 

2) Asynchronous accesses to the interrupt flag register (IF) may cause the 
TMS320C3x to fail to recognize and service an interrupt. This may occur 
when an interrupt is generated and is ready to be latched into the IF regis- 
ter on the same cycle that the IF is being written to by the CPU. 

The logic currently gives the CPU write priority ; consequently, the asserted 
interrupt may be lost. This is particularly true if the asserted interrupt has 
been generated internally, such as a DMA interrupt. This situation may 
arise as a result of a decision to poll certain interrupts or a desire to clear 
pending interrupts due to a long pulse width. For the case of the long pulse 
width, the interrupt may be generated after the CPU responds to the inter- 
rupt and attempts to automatically clear it by the interrupt vector process. 

The recommended solution is not to use the interrupt polling technique but 
to design the external interrupt inputs to have pulse widths between 1 and 
2 instruction cycles in length. The alternative to strict polling is to periodi- 
cally enable and disable the interrupts that would be polled, thereby allow- 
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ing the normal interrupt vectoring to take place; that automatically clears 
the interrupt flag without affecting other interrupts. In the event there is a 
need to clear a pending interrupt, it is recommended that a memory loca- 
tion be used to indicate that the interrupt is invalid. Then the interrupt ser- 
vice routine can read that location, clear it (if the pending interrupt is inval- 
id), and return immediately. The following code fragments show how a 
dummy interrupt due to a long interrupt pulse might be handled: 



ISR_n: 


PUSH 


ST 




PUSH 


DP 




PUSH 


RO 




LDI 


0, DP 




LDI 


0DUMMY INT, RO 




BNN 


ISR n START 




STI 


DP, @ DUMMY INT 




POP 


RO 




POP 


DP 




POP 


ST 




RET I 




ISR_n_START 


: • 






LDI 


INT Fn, RO 




AND 


IF, RO 




BZ 


ISR n END 




LDI 


, DP 




LDI 


OFFFFh, RO 




STI 


RO, @DUMMY_INT 


ISR_n_END: 








POP 


RO 




POP 


DP 




POP 


ST 




RET I 





Save registers 

Clear Data Page Pointer 

If DUMMY_INT is or positive, 

go to ISR_n_START 

Set DUMMY INT = 



Housekeeping, return from interrupt 



normal interrupt service routine 
code goes here 

If ones in IF reg match 

INT_Fn, exit ISR 

Otherwise clear 

DP and set 

DUMMY INT negative & e:-:it 



Exit ISR 
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6.6.4 Prioritization and Control 



The CPU controls all prioritization of interrupts (see Table 6-4 for reset and in- 
terrupt vector locations and priorities). If the DMA is not using interrupts for syn- 
chronization of transfers, it will not be affected by the processing of the CPU 
interrupts. If the CPU is involved in a pipeline conflict (branch, register, or 
memory), it will not respond to the interrupts until that conflict is resolved. It is 
therefore possible to interrupt the CPU and DMA simultaneously with the same 
or different interrupts and, in effect, synchronize their activities. For example, 
it may be necessary to cause a high-priority DMA transfer that avoids bus con- 
flicts with the CPU, i.e., make the DMA higher priority than the CPU. This may 
be accomplished by using an interrupt that causes the CPU to trap to an inter- 
rupt routine that contains an IDLE instruction. Then if the same interrupt is used 
to synchronize DMA transfers, the DMA transfer counter can be used to gener- 
ate an interrupt and, thus to return control to the CPU following the DMA trans- 
fer. 

Since the DMA and CPU share the same set of interrupt flags, the DMA may 
clear an interruptflag before the CPU can respond to it. For example, if the CPU 
interrupts are disabled, the DMA can respond to interrupts and thus clear the 
associated interrupt flags. 



Table 6-4. Reset and Interrupt Vector Locations 



Reset or 
Interrupt 


Vector 
Location 


Priority 


Function 


RESET 


Oh 





External reset signal input on the RESET 
pin. 


INTO 


1h 


1 


External interrupt input on the INTO pin. 


INTT 


2h 


2 


External interrupt input on the INT1 pin. 


INT2 


3h 


3 


External interrupt input on the INT2 pin. 


INT3 


4h 


4 


External interrupt input on the INT3 pin. 


XINTO 


5h 


5 


Internal interrupt generated when serial-port 
transmit buffer is empty. 


RINTO 


6h 


6 


Internal interrupt generated when serial-port 
receive buffer is full. 


XINT1 t 


7h 


7 


Internal interrupt generated when serial-port 
1 transmit buffer is empty. 


RINT1 t 


8h 


8 


Internal interrupt generated when serial-port 
1 receive buffer is full. 


TINTO 


9h 


9 


Internal interrupt generated by timer 0. 


TINT1 


OAh 


10 


Internal interrupt generated by timer 1 . 


DINT 


OBh 


11 


Internal interrupt generated by DMA control- 
ler 0. 



t Reserved on TMS320C31 
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If there is a delayed branch in the pipeline, interrupts are held pending until after 
the branch. If the interrupt occurs in the first cycle of the fetch of an instruction, 
the fetched instruction is discarded (not executed), and the address of that in- 
struction is pushed to the top of the system stack. If the interrupt occurs after 
the first cycle of the fetch, in the case of a multicycle fetch due to wait states, 
that instruction is executed and the address of the next instruction to be fetched 
is pushed to the top of the system stack. If no program fetch is occurring, then 
no new fetch is performed. After the address of the appropriate instruction has 
been pushed, the interrupt vector is fetched and loaded into the PC, and ex- 
ecution continues. 

The TMS320C3x allows the CPU and DMA to respond to and process inter- 
rupts in parallel. Figure 6-6 shows interrupt processing flow. The interrupts are 
polled and the CPU and DMA begin processing them. In the interrupt flow per- 
taining to the CPU, the interrupt flag corresponding to the highest-priority en- 
abled interrupt is cleared, and GIE is set to 0. The CPU completes all fetched 
instructions. The interrupt vector is fetched and loaded into the PC, and the 
CPU continues execution. The DMA cycle is similar to that for the CPU. After 
the pertinent interrupt flag is cleared, the DMA proceeds according to the sta- 
tus of the SYNCH bits in the DMA global control register. 
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Figure 6-6. Interrupt Processing 
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Chapter 7 

External Bus Operation 



Memories and external peripheral devices can be accessed with two external 
interfaces on the TMS320C30: the primary bus and the expansion bus. On the 
TMS320C31 , one bus, the primary bus, is available to access external memo- 
ries and peripheral devices. Wait-state generation, permitting access to slower 
memories and peripherals, can be controlled by manipulating memory- 
mapped control registers associated with the interfaces and by an external in- 
put signal. 

Major topics discussed in this hardware interface section are listed below. 

Q External Interface Control Registers (Section 7.1 on page 7-2) 

■ Primary bus 

■ Expansion bus 

□i External Interface Timing (Section 7.2 on page 7-5) 
Q Programmable Wait States (Section 7.3 on page 7-27) 
Qi Programmable Bank Switching (Section 7.4 on page 7-29) 
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7.1 External Interface Control Registers 



The TMS320C30 provides two external interfaces: the primary bus and the ex- 
pansion bus. The TMS320C31 provides one external interface: the primary 
bus. The primary bus consists of a 32-bit data bus, a 24-bit address bus, and 
a set of control signals. The expansion bus consists of a 32-bit data bus, a 
1 3-bit address bus, and a set of control signals. Both buses support software- 
controlled wait states and an external ready input signal, and both buses are 
useful for data, program, and I/O accesses. 



Access is determined by an active strobe signa l (STRB, MSTRB, IOSTRB). 
When a primary bus access is performed, STRB is low. The expansion bus of 
theTMS320C30 supports two types of accesses: 



1 ) Memory access signalled by MST RB low. The timing for a MSTRB access 
is the same as that of the STRB access on the primary bus. 



2) External peripheral device access is signaled by IOSTRB low. 

The primary bus and the expansion bus each have an associated control regis- 
ter. These registers are memory-mapped as shown in Figure 7-1 . 



Figure 7-1. Memory-Mapped External Interface Control Registers 

Register 



Expansion Bus Control (See subsection 7.1 .2)t 



Reserved 



Reserved 



Reserved 



Primary Bus Control (See subsection 7.1.1) 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Peripheral 
Address 

808060h 

808061 h 

808062h 

808063h 

808064h 

808065h 

808066h 

808067h 

808068h 

808069h 

80806Ah 

80806Bh 

80806Ch 

80806Dh 

80806Eh 

80806Fh 



t Reserved on the TMS320C31 
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7.1.1 Primary-Bus Control Register 



The primary bus control register is a 32-bit register that contains the control bits 
for the primary bus (see Figure 7-2). Table 7-1 lists the register bits with the 
bit names and functions. 



Figure 7-2. Primary-Bus Control Register 
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NOTE: 



xx = reserved bit, read as 0. 
R = read, W = write. 



Table 7-1. Primary-Bus Control Register Bits Summary 



Bit 


Name 


Reset 
Value 


Function 





HOLDST 


xt 


Hold status bit. This bit signals whether the port is being held 
(HOLDST = 1) or is not being held (HOLDST = 0). This status bit is 
valid whether the port has been held via hardware or software. 


1 


NOHOLD 





Port hold signal. NOHOLD allows or disallows the port to be held by 
an external HOLD signal. When NOHOLD = 1 , the TMS320C3x takes 
overthe external bus and controls it, regardless of serviced orpending 
requests by external devices. No hold acknowledge (HOLDA) is as- 
serted when a HOLD is received. However, it is asserted if an internal 
hold is generated (HIZ = 1). NOHOLD is set to at reset. 


2 


HIZ 





Internal hold. When set (HIZ = 1), the port is put in hold mode. This 
is equivalent to the external HOLD signal. By forcing a high-impe- 
dance condition, the TMS320C3x can relinquish the external memory 
port through software. HOLDA goes low when the port is placed in the 
high-impedance state. HIZ is set to at reset. 


4 — 3 


SWW 


11 


Software wait mode. In conjunction with WTCNT, this 2-bit field de- 
fines the mode of wait-state generation. It is set to 1 1 at reset. 


7 — 5 


WTCNT 


111 


Software wait mode. This 3-bit field specifies the number of cycles to 
use when in software wait mode for the generation of internal wait 
states. The range is zero (WTCNT = 0) to seven (WTCNT = 111) 
H1/H3 cycles. It is set to 1 1 1 at reset. 


12 — 8 


BNKCMP 


10000 


Bank compare. This 5-bit field specifies the number of MSBs of the 
address to be used to define the bank size. It is set to 1 at reset. 


31 —13 


Reserved 


0-0 


Read as 0. 



t x = or 1 
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7.1.2 Expansion-Bus Control Register 



The expansion bus control register is a 32-bit register that contains control bits 
for the expansion bus (see Figure 7-3 and Table 7-2). 



Figure 7-3. Expansion-Bus Control Register 
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NOTE: xx = reserved bit, read as 0. 
R = read, W = write. 



Table 7-2. Expansion-Bus Control Register Bits Summary 



Bit 


Name 


Reset 
Value 


Function 


2— 


Reserved 


000 


Read as 0. 


4—3 


SWW 


11 


Software wait-state generation. In conjunction with the WTCNT, 
this 2-bit field defines the mode of wait-state generation. It is set 
to 1 1 at reset. 


7 — 5 


WTCNT 


111 


Software wait mode. This 3-bit field specifies the number of cycles 
to use when in software wait mode for the generation of internal 
wait states. The range is zero (WTCNT. = 0) to seven 
( WTCNT= 111) H1/H3 clock cycles. It is set to 1 1 1 at reset. 


31—8 


Reserved 


0-0 


Read as 0. 
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7.2 External Interface Timing 



This section discusses functional timing of operations on the primary bus and 
the expansion bus, the TMS320C3x's two independent parallel buses. De- 
tailed timing specifications for all TMS320C3x signals are contained in Chap- 
ter 1 3, TMS320C3x Signal Descriptions and Electrical Characteristics. 

The parallel buses implement three mutually exclusive a ddress spaces d istin- 
guished t hrou gh the u se of three separate control signals: STRB, MSTRB, and 
IQSTRB . Th e STRB signal controls accesses on the primary bus, and the 
MSTRB and IOSTRB control accesses on the expansion bus. Since the two 
buses are independent, two accesses may be made in parallel. 

With the exception of bank switching and the external HOL D functio n (dis- 
cussed later in this section), timing of primary bus cycles and MSTRB expan- 
sion bus cycles are identical and are discussed c ollectiv ely. The acr onym 
(M)STRB is_used in references tha t perta in equally to STRB and MSTRB. Sim- 
ilarly, (X)R/W, (X)A, (X)D, and (X)RDY ar e used to symbolize the equivalent 
primary and expansion bus signals. The IOSTRB expansion bus cycles are 
timed differently and are discussed independently. 



7.2.1 Primary-Bus Cycles 



All bus cycles comprise integral numbers of H1 clock cycles. One H1 cycle is 
defined to be from one falling edge of H 1 to the next falling edge of H 1 . For full- 
speed (zero wait-state) accesses, writes take two H1 cycles and reads take 
one cycle; however, if the read follows a write, t he read takes two cycles.This 
applies to both the primary bus and the MSTRB expansion bus access. Recall 
that, internally (from the perspective of the CPU and DMA), writes require only 
one cycle if no accesses to that interface are in progress. The following discus- 
sions pertain to zero wait-state accesses unless otherwise specified. 



The (M)STRB signal is low for the active portion of both reads and writes, which 
lasts one H1 cycle. Additionally, before and after the active por- 
tion — (M)STRB low — of writes only, there is a transition cycle of H1 . During 
this transition cycle, the following occur: 



1) (M)STRB is high. 

2) If required, (X)R/W changes state on H1 rising. 

3) If required, address changes on H1 rising if the previous H1 cycle was the 
• active portion of a write. If the previous H1 cycle was a read, address 

changes on H1 falling. 
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Figure 7-4 illustrates a read-read-write sequence for (M)STRB active and no 
wait states. The data is read as late in the cycle as possible to allow for the max- 
imum access time from address valid. Note that although external writes take 
two cycles, internally (from the perspective of the CPU and DMA), they require 
only one cycle if no accesses to that interfaceare in progress. In the typical tim- 
i ng for al l e xternal i nterfaces, the (X)R/W strobe does not change until 
(M)STRB or IOSTRB goes inactive. 



Figure 7-4. Read-Read-Write for (M)STRB = 
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Figure 7-5 illustrates a write-write-read sequence for (M)STRB active and no 
wait states. The addre ss and data written are held valid approximately one-half 
cycle after (M)STRB changes. 



Figure 7-5. Write-Write-Read for (M)STRB = 




(X)RDY 
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Figure 7-6 illustrates a r ead cycle with one wait state. Since (X)RDY = 1 , the 
read cycle is extende d. (M)ST RB, (X)R/W, and (X)A are also extended one 
cycle. The next time (X)RDY is sampled, it is 0. 



Figure 7-6. Use of Wait States for Read for (M)STRB = 
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Figure 7-7 illustrates a write cy cle with o ne waitstate. Since initially (X)RDY = 
1 , the write cycle is exte nded . (M)STRB, (X)R/W, and (X)A are extended one 
cycle. The next time (X)RDY is sampled, it is 0. 



Figure 7-7. Use of Wait States for Write for (M)STRB = 
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7.2.2 Expansion-Bus I/O Cycles 



In contrast to primary bus and MSTRB cycles, IOSTRB reads and writes are 
both two cycles in duration (with no wait states) and exhibit the same timing. 
During th ese cycles, address always changes on the falling edge of H1 , and 
IOSTRB is low from the r ising edg e of the first H1 cycle to the rising edge of 
the second H1 cycle. The IOSTRB signal always goes inactive (high) between 
cycles, and XR/W is high for reads and low for writes. 



Figure 7-8 illustrates read and write cycles when IOSTRB is active and there 
are no wait states. For IOSTRB accesses, reads and writes require a minimum 
of two cycles. Some off-chip peripherals may change their status bits when 
read or written. Therefore, it is important that valid addresses be maintained 
when co mmunica ting with these peripherals. For reads and writes when 
IOSTRB is active, IOSTRB is completely framed by the address. 



Figure 7-8. Read and Write for IOSTRB = 
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Figure 7-9 illustrates a read with one wait state when I OSTRB is active, and 
Figure 7-1 illustrates a write w ith one wait state when IOSTRB is active. For 
each wait state added, IOSTRB, XR/W, and XA are extended one cloc k cycle. 
Writes hold the data on the bus one additional cycle. The sampling of XRDY 
is repeated each cycle. 



Figure 7-9. Read With One Wait State for IOSTRB = 
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Figure 7-10. Write With One Wait-State for IOSTRB = 
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Figure 7-11 through Figure 7-21 illustrate the various transitions between 
memory reads and writes, and I/O writes over the expansion bus. 



Figure 7-11. Memory Read and I/O Write for Expansion Bus 
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Figure 7- 12. Memory Read and I/O Read for Expansion Bus 
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Figure 7- 13. Memory Write and I/O Write for Expansion Bus 
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Figure 7- 14. Memory Write and I/O Read for Expansion Bus 
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Figure 7-15. I/O Write and Memory Write for Expansion Bus 
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Figure 7-16. I/O Write and Memory Read for Expansion Bus 
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Figure 7-17. I/O Read and Memory Write for Expansion Bus 
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Figure 7-18. I/O Read and Memory Read for Expansion Bus 
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Figure 7-19. I/O Write and I/O Read for Expansion Bus 
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Figure 7-20. I/O Write and I/O Write for Expansion Bus 
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Figure 7-21. I/O Read and I/O Read for Expansion Bus 
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Figure 7- 22 and F igur e 7-23 illu strate the signal states when a bus is i nactive 
(after an IOSTRB or (M)STRB access, respectively). The strobes (STRB, 
MSTRB, IOS TRB) a nd (X)R /W) go to 1. The address is undefined, and the 
ready signal (XRDY or RDY) is ignored. 



Figure 7-22. Inactive Bus States for IOSTRB 
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Figure 7-23. Inactive Bus States for STRB and MSTRB 
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Figure 7-24 illustrates the timing for HOLD and HOLDA. HOLD is an external 
asynchronous input. Ther e is a m inimum of one cyc le delay fr om the tim e when 
the processor recognizes HOLD = until HOLDA = 0. When HOLDA = 0, the 
address, data buses, and associated strobes are placed in a high-impedance 
state. All accesses occurring over an interface are complete before a hold is 
acknowledged. 



Figure 7-24. HOLD and HOLDA Timing 
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7.3 Programmable Wait States 



Control wait-state generation by manipulating memory- mapped control regis- 
ters associated with both the primary and expansion interfaces. Use the 
WTCNT field to load an internal timer, and use the SWW field to select one of 
the following four modes of wait-state generation: 



Qi External RDY 



□i WTCNT-generated RDYwtcnt 



Logical-AND of RDY and RDYwtcnt 



Qi Logical-OR of RDY and RDYwtcnt 



The four modes are used to ge nerate the internal ready signal, RDYj nt , that 
controls accesse s. As long as RDYj nt = 1 , the current external access is 
delayed. When RDYj nt = 0, the current access completes. Since the use of 
programmable wait states for both external interfaces is identical, only the pri- 
mary bus interface is described in the following paragraphs. 



RDY wtcnt is an internally generated ready signal. When an external access is 
begun, the value in WTCNT is loaded into a counter. WTCNT may be any value 
from through 7. The counter is decremented every H1/H3 clock cycle until 
it becomes 0. Once the counter is set to 0, i t remains set to until the next ac- 
cess. While the counter is nonzero, RDY wtcnt = 1. While the counter is 0, 
RDY wtcnt = 0- 
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When SWW = 0, RDYj nt is dependent only upon RDY. RDY wtcnt is ignored. 
The truth table for this mode is Table 7-3. 



Table 7-3. Wait-State Generation When SWW= 



RDY 


RDY w tc n t 


RDYjnt 




1 
1 



1 

1 





1 
1 



When SWW = 01, RDY int is dependent only upon RDY wtcnt . RDY is ignored. 
Table 7-4 is the truth table for this mode. 



Table 7-4. Wait-State Generation When SWW= 1 



RDY 


RDY wtcn t 


RDYi nt 





1 • 
1 



1 

1 



1 


1 



When SWW = 1 0, R DY int is t he log ical-OR (electrical-AND, since these 
signals are low true) of RDY and RDY w t cn t (see Table 7-5). 



Table 7-5. Wait-State Generation When SWW= 1 



RDY 


RDYwtcnt 


RDYmt 




1 
1 



1 

1 






1 



When SWW = 11, R DY int is th e log ical-AND (electrical-OR, since these 
signals are low true) of RDY and RDY w t cn t- The truth table for this mode is 
Table 7-6. 



Table 7-6. Wait-State Generation When SWW= 1 i 



RDY 


RDY w tcnt 


RDYjnt 




1 
1 



1 


1 



1 
1 
1 
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7.4 Programmable Bank Switching 



Programmable bank switching allows you to switch between external memory 
banks without externally inserting wait states due to memories that require sev- 
eral cycles to turn off. Bank switching is implemented on the primary bus and 
not on the expansion bus. 

The size of a bank is determined by the number of bits specified to be ex- 
amined. For example (see Figure 7-25), if BNKCMP =1 6, the 1 6 MSBs of the 
address are used to define a bank. Since addresses are 24 bits, the bank size 
is specified by the 8 LSBs, yielding a bank size of 256 words. If BNKCMP > 1 6, 
only the 1 6 MSBs are compared. Bank sizes from 2 8 = 256 to 2 24 = 1 6M are 
allowed. Table 7-7 summarizes the relationship between BNKCMP, the ad- 
dress bits used to define a bank, and the resulting bank size. 



Figure 7-25. BNKCMP Example 
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Table 7-7. BNKCMP and Bank Size 



BNKCMP 


MSBs Defining a Bank 


Bank Size (32-Bit Words) 


00000 


None 


224= 1 6M 


00001 


23 


223= 8M 


00010 


23—22 . 


222= 4M 


00011 


23—21 


221 =2M 


00100 


23—20 


220= 1M 


00101 


23—19 


219=512K 


00110 


23—18 


2"I8=256K 


00111 


23—17 


2 17 =128K 


01000 


23—16 


2"I6=64K 


01001 


23—15 


2 1 5=32K 


01010 


23—14 


2"14=16K 


01011 


23—13 


2"I3=8K 


01100 


23—22 


2 12 =4K 


01101 


23—11 


211 = 2K 


01110 


23—12 


2"I0=1K 


01111 


23—9 


2 9 =512 


10000 


23—8 


28 =256 


10000 —11111 


Reserved 


Undefined 
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The TMS320C3x has an internal register that contains the MSBs (as defined 
by the BNKCMP field) of the last address used for a read or write over the pri- 
mary interface. At reset, the register bits are set to zero. If the MSBs of the ad- 
dress being used forthe current primary interface read do not match those con- 
tained in this internal register, a read cycle is not asserted for one H1/H3 clock 
cycle. During this ex tra clock cycle, the address bus switches over to the new 
address, but STRB is inactive (high). The contents of the internal register are 
replaced with the MSBs being used for the current read of the current address. 
If the MSBs of the address being used for the current read match the bits in 
the register, a normal read cycle takes place. 

If repeated reads are performed from the same memory bank, no extra cycles 
are inserted. When a read is performed from a different memory bank, memory 
conflicts are avoided by the insertion of an extra cycle. This feature can be dis- 
abled by setting BNKCMP to 0. The insertion of the extra cycle occurs only 
when a read is performed. The changing of the MSBs in the internal register 
occurs for all reads and writes over the primary interface. 

Figure 7-26 illustrates the addition of an inactive cycle when switches between 
banks of memory occur. 



Figure 7-26. Bank Switching Example 
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Chapter 8 

Peripherals 



The TMS320C3x features two timers, two serial ports (one on the 
TMS320C31 ), and an on-chip Direct Memory Access (DMA) controller. These 
peripheral modules are controlled through memory-mapped registers located 
on the dedicated peripheral bus. 

The DMA controller is used to perform input/output operations without interfer- 
ing with the operation of the CPU. Therefore, it is possible to interface the 
TMS320C3x to slow external memories and peripherals (A/Ds, serial ports, 
etc.) without reducing the computational throughput of the CPU. The result is 
improved system performance and decreased system cost. 

Major topics discussed in this chapter on peripherals are listed below. 

Ql Timers (Section 8.1 on page 8-2) 

a Registers 

H Pulse generation 

a Operation modes 

□ Serial Ports (Section 8.2 on page 8-12) 

a Registers 

H Operation configurations 

n Timing 

n Examples 
□l DMA Controller (Section 8.3 on page 8-38) 

■ Registers 

■ DMA memory transfer operation 
n Synchronization of DMA channels 
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8.1 Timers 



The TMS320C3x timer modules are general-purpose, 32-bit, timer/event 
counters, with two signaling modes and internal or external clocking (see 
Figure 8-1). The timer modules can be used to signal to the TMS320C3x or 
the external world at specified intervals, or to count external events. With an 
internal clock, the timer can be used to signal an external A/D converter to start 
a conversion, or it can interrupt the TMS320C3x DMA controller to begin a data 
transfer. The timer interrupt is one of the internal interrupts. With an external 
clock, the timer can count external events and interrupt the CPU after a speci- 
fied number of events. Available to each timer is an I/O pin that can be used 
as an input clock to the timer, an output clock signal, or a general-purpose I/O 
pin. 



Figure 8-1. Timer Block Diagram 
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Three memory-mapped registers are used by each timer: 

Qi Global-control register 

Qi Period register 

□i Counter register 

The global-control register determines the operating mode of the timer, 
monitors the timer status, and controls the function of the I/O pin of the timer. 
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The period register specifies the timer's signaling frequency. The counter 
register contains the current value of the incrementing counter. The timer can 
be incremented on the rising edge or the falling edge of the input clock. The 
counter is zeroed and can cause an internal interrupt whenever its value equals 
that in the period register. The pulse generator generates two types of external 
clock signals: pulse or clock. The memory map for the timer modules is shown 
in Figure 8-2. 



Figure 8-2. Memory-Mapped Timer Locations 
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8.1.1 Timer Global-Control Register 



The timer global control register is a 32-bit register that contains the global and 
port control bits for the timer module. Table 8-1 defines the register bits, 
names, and functions. Bits 3 — are the port control bits; bits 11 — 6 are the 
timer global control bits. Figure 8-3 shows the 32-bit register. Note that at re- 
set, all bits are set to except for DATIN (set to the value read on TCLK). 
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Figure 8-3. Timer Global-Control Register 
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Bits 


Name 


Reset Value 


Function 





FUNC : 





FUNC controls the function of TCLK. If FUNC = 0, TCLK is configured 
as a general-purpose digital I/O port. If FUNC = 1 , TCLK is configured 
as a timer pin (see Figure 8-7 for a description of the relationship 
between FUNC and CLKSRC). 


1 


I/O 





If FUNC = and CLKSRC = 0, TCLK is configured as a general-pur- 
pose I/O pin. In this case, if I/O = 0, TCLK is configured as a general- 
purpose input pin. If I/O = 1 , TCLK is configured as a general-purpose 
output pin. 


2 


DATOUT 





DATOUT drives TCLK when the TMS320C3x is in I/O port mode. 
DATOUT can also be used as an input to the timer. 


3 


DATIN 


xt 


Data input on TCLK or DATOUT. A write has no effect. 


5 — 4 


Reserved 


0-0 


Read as 0. 


6 


GO 





The GO bit resets and starts the timer counter. When GO = 1 and the 
timer is not held, the counter is zeroed and begins incrementing on the 
next rising edge of the timer input clock. The GO bit is cleared on the 
same rising edge. GO = has no effect on the timer. 


7 


FED 





Counter hold signal. When this bit is zero, the counter is disabled and 
held in its current state. If the timer is driving TCLK, the state of TCLK 
is also held. The internal divide-by-two counter is also held so that the 
counter can continue where it left off when HLD is set to 1 . The timer 
registers can be read and modified while the timer is being held. 
RESET has priority over HLD. Table 8-2 shows the effect of writing 
to GO and HLD. 


8 


C/P 





Clock/Pulse mode control. When C/P = 1 , clock mode is chosen, and 
the signaling of the status flag and external output will have a 50 
percent duty cycle. When C/P = 0, the status flag and external output 
will be active for one H1 cycle during each timer period (see 
Figure 8-4). 


9 


CLKSRC 





Specifies the source of the timer clock. When CLKSRC = 1 , an internal 
clock with frequency equal to one-half the H1 frequency is used to in- 
crement the counter. The INV bit has no effect on the internal clock 
source. When CLKSRC = 0, an external signal from the TCLK pin can 
be used to increment the counter. The external clock is synchronized 
internally, thus allowing external asynchronous clock sources that do 
not exceed the specified maximum allowable external clock frequen- 
cy. This will be less than f(H1 )/2. (See Figure 8-7 for a description of 
the relationship between FUNC and CLKSRC). 



t x = or 1 
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Table 8- 1. Timer Global-Control Register Bits Summary (Continued) 



Bits 


Name 


Reset Value 


Function 


10 


INV 





Inverter control bit. If an external clock source is used and INV = 1 , the 
external clock is inverted as it goes into the counter. If the output of the 
pulse generator is routed to TCLK and INV = 1 , the output is inverted 
before it goes to TCLK (see Figure 8-1). If INV = 0, no inversion is 
performed on the input or output of the timer. The INV bit has no effect, 
regardless of its value, when TCLK is used in I/O port mode. 


11 


TSTAT 





This bit indicates the status of the timer. It tracks the output of the 
uninverted TCLK pin. This flag sets a CPU interrupt on a transition from 
to 1 . A write has no effect. . 


31—12 


Reserved 


0-0 


Read as 0. 



Table 8-2 shows the result of a write using specified values of the GO and HLD 
bits in the global control register. 



Table 8-2. Result of a Write of Specified Values of GO and HLD 



GO 


HLD 


Result 








All timer operations are held. No reset is performed. (Reset value) 





1 


Timer proceeds from state before write. 


1 





All timer operations are held, including zeroing of the counter. The GO bit 
is not cleared until the timer is taken out of hold. 


1 


1 


Timer resets and starts. 
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8.1.2 Timer Period and Counter Registers 



The 32-bit timer period register is used to specify the frequency of the timer sig- 
naling. The timer counter register is a 32-bit register, which is reset to zero 
whenever it increments to the value of the period register. Both registers are 
set to at reset. 

Certain boundary conditions affect timer operation, such as a zero in the period 
register and an overflow of the counter. These conditions are listed as follows: 

Qi When the period and counter registers are zero, the operation of the_timer 
is dependent upon the C/P mode selected. In pulse mode (C/P = 0), 
TSTAT is set and remains set. In clock mode (C/P = 1), the width of the 
cycle is 2/f(H1), and the external clocks are ignored. 

Qi When the counter register is not and the period register = 0, the counter 
will count, roll over to 0, and then behave as described above. 

£1 When the counter register is set to a value greater than the period register, 
the counter may overflow when being incremented. Once the counter 
reaches its maximum 32-bit value (OFFFFFFFFh), it simply clocks over to 
and continues. 

Writes from the peripheral bus override register updates from the counter 
and new status updates to the control register. 



8.1.3 Timer Pulse Generation 



The timer pulse generator (see Figure 8-1) can generate several different ex- 
ternal signals. These signals may be inverted with the IN V bit. The two basic 
modes are pulse mode and clock mode, as shown in Figure 8-4. In both 
modes, an internal clock source has a frequency of f(H1)/2, and an external 
clock source has a maximum frequency of f(H1)/2.6. Refer to timer timing in 
Appendix A. In pulse mode (C/P = 0), the width of the pulse is 1/f(H1). 
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Figure 8-4. Timer Timing 
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(a) TSTAT and Timer Output (INV = 0) When C/P = (Pulse Mode) 



-H— 1/f(CLKSRC) 



2/f(H1) 



J 



r 



period register/f(CLKSRC) 



t 

TINT 



2 x period register/f(CLKSRC) 

t 

TINT 



t 

TINT 



(b) TSTAT and Timer Output (INV = 0) When C/P = 1 (Clock Mode) 

The rate of timer signaling is determined by the frequency of the timer input 
clock and the period register. The following equations are valid with either an 
internal or an external timer clock: 

f(pulse mode) = f(timer clock) / period register 

f(clock mode) = f(timer clock) / (2 x period register) 

Figure 8-5 provides some examples of the TCLKx output when the period reg- 
ister is set to various values and clock or pulse mode is selected. 
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Figure 8-5. Timer Output Generation Examples 
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(d) INV = 0, C/P = 1 (Clock Mode) 
Timer Period = 
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(e) INV = 0, C/P = 1 (Clock Mode) 
Timer Period = 1 
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(f) INV = 0, C/P = 1 (Clock Mode) 
Timer Period = 2 
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8.1.4 Timer Operation Modes 



The timer can receive its input and send its output in several different modes, 
depending upon the setting of CLKSRC, FUNC, and I/O. The four timer modes 
of operation are defined as follows: 

Qi If CLKSRC = 1 and FUNC = 0, the timer input comes from the internal 
clock. The internal clock is not affected by the INV bit. In this mode, TCLK 
is connected to the I/O port control and can be used as a general-purpose 
I/O pin (see Figure 8-6). If I/O = 0, TCLK is configured as a general-pur- 
pose input pin whose state can be read in DATIN. DATOUT has no effect 
on TCLK or DATIN. If 1/0 = 1, TCLK is configured as a general-purpose 
output pin. DATOUT is placed on TCLK and can be read in DATIN. 



Figure 8-6. Timer I/O Port Configurations 
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Qi If CLKSRC = 1 and FUNC = 1, the timer input comes from the internal 
clock, and the timer output goes to TCLK. This value may be inverted using 
INV, and the value output on TCLK can be read in DATIN. 

Q If CLKSRC = and FUNC = 0, the timer is driven according to the status 
of the I/O bit. If I/O = 0, the timer input comes from TCLK. This value can 
be inverted using INV, and the value of TCLK can be read in DATIN. If I/O 
= 1 , TCLK is an output pin. Then, TCLK and the timer are both driven by 
DATOUT. All 0-to-1 transitions of DATOUT increment the counter. INV has 
no effect on DATOUT. The value of DATOUT can be read in DATIN. 

□ If CLKSRC = and FUNC = 1 , TCLK drives the timer. If INV = 0, all 0-to-1 
transitions of TCLK increment the counter. If INV = 1 , all 1 -to-0 transitions 
of TCLK increment the counter. The value of TCLK can be read in DATIN. 
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Figure 8-7 shows the four timer modes of operation. 
Figure 8-7. Timer Modes as Defined by CLKSRC and FUNC 
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8.1.5 Timer Interrupts 



A timer interrupt is generated whenever the timer automatically resets the tim- 
er counter register to zero. The timer counter register is automatically reset to 
zero whenever it is equal to or greater than the value in the timer period register. 
The timer interrupt can be used to interrupt either the CPU or the DMA. Inter- 
rupt enable control for each timer, for either the CPU or the DMA, is found in 
the CPU/DMA interrupt enable register. Refer to subsection 3.1 .8 for more in- 
formation on the CPU/DMA interrupt enable register. 

When a timer interrupt occurs, a change in state of the corresponding TCLK 
pin will be observed if the FUNC = 1 and CLKSRC = 1 in the timer global-control 
register. The exact change of state depends on the state of the C/P bit. 
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8.1.6 Timer Initialization/Reconfiguration 



The timers are controlled through memory-mapped registers located on the 
dedicated peripheral bus. A general procedure for initializing and/or reconfi- 
guring the timers follows: 



1 ) Halt the timer by clearing the GO/HLD bits of the timer global-control regis- 
ter. This can be accomplished by writing a to the timer global-control reg- 
ister. Note that the timers are halted on RESET. 



2) Configure the timer via the timer global-control register (with GO = HLD 
= ), as well as the timer counter register and timer period register, if nec- 
essary. 



3) Start the timer by setting the GO/HLD bits of the timer global-control regis- 
ter. 
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8.2 Serial Ports 



The TMS320C30 has two totally independent bidirectional serial ports. Both 
serial ports are identical with a complementary set of control registers in each 
one. Only one serial port is available on the TMS320C31 . Each serial port can 
be configured to transfer 8, 1 6, 24, or 32 bits of data per word simultaneously 
in both directions. The clock for each serial port can originate either internally, 
via the serial port timer and period registers, or externally, via a supplied clock. 
An internally generated clock is a divide-down of the clockout frequency, f(H1 ). 
A continuous transfer mode is available, which allows the serial port to transmit 
and receive any number of words without new synchronization pulses. 

Eight memory-mapped registers are provided for each serial port: 

□i Global-control register 

Q Two control registers for the six serial I/O pins 

□ Three receive/transmit timer registers 

□ Data-transmit register 

.Q Data-receive register 

The global-control register controls the global functions of the serial port and 
determines the serial-port operating mode. Two port control registers control 
the functions of the six serial port pins. The transmit buffer contains the next 
complete word to be transmitted. The receive buffer contains the last complete 
word received. Three additional registers are associated with the transmit/re- 
ceive sections of the serial-port timer. A serial-port block diagram is shown in 
Figure 8-8, and the memory map of a serial port is shown in Figure 8-9. 
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Figure 8-8. Serial-Port Block Diagram 
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Figure 8-9. Memory-Mapped Locations for the Serial Port 
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8.2.1 Serial-Port Global-Control Register 



Peripheral Address 
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80805Fh 



The serial-port global-control register is a 32-bit register that contains the glob- 
al control bits for the serial port. Table 8-3 defines the register bits, bit names, 
and bit functions. The register is shown in Figure 8-10. 
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Figure 8-10. Serial-Port Global-Control Register 
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RAW R/W R/W R 



7ab/e 8-3. Serial-Port Global-Control Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





RRDY 





If RRDY = 1 , the receive buffer has new data and is ready to be read. A three H1/H3 
cycle delay occurs from the reading of DRR to RRDY = 1 . The rising edge of this sig- 
nal sets RINT. If RRDY= at reset, the receive buffer does not have new data since 
the last read. RRDY = at reset and after the receive buffer is read. 


1 


XRDY 


1 


If XRDY = 1 , the transmit buffer has written the last bit of data to the shifter and is 
ready for a new word. A three H1 /H3 cycle delay occurs from the loading of the trans- 
mitshifteruntilXRDYissettol. The rising edgeofthissignalsetsXINT. If XRDY = 0, 
the transmit buffer has not written the last bit of data to the transmit shifter and is not 
ready for a new word. XRDY = 1 at reset. 


2 


FSXOUT 





This bit configures the FSX pin as an input (FSXOUT= 0) or an output (FSXOUT = 1 ). 


3 


XSREMPTY 





If XSREMPTY = 0, the transmit shift register is empty. If XSREMPTY = 1 , the transmit 
shift register is not empty. Reset or XRESET causes this bit to = 0. 


4 


RSRFULL 





If RSRFULL = 1 , an overrun of the receiver has occurred. In continuous mode, 
RSRFULL is set to 1 when both RSR and DRR are full. In noncontinuous mode, 
RSRFULL is setto 1 when RSR and DRR are full and a new FSR is received. A read 
causes this bit to be set to 0. This bit can be set to only by a system reset, a serial 
port receive reset (RRESET = 1 ), or a read. When the receiver tries to set RSRFULL 
to a 1 at the same time that the global register is read, the receiver will dominate and 
RSRFULL is set to 1 . If RSRFULL = 0, no overrun of the receiver has occurred. 


5 


HS ' 





If HS = 1 , the handshake mode is enabled. If HS = 0, the handshake mode is disabled. 


6 


XCLKSRCE 





If XCLKSRCE = 1 , the internal transmit clock is used. If XCLKSRCE = 0, the external 
transmit clock is used. 


7 


RCLKSRCE 





If RCLKSRCE = 1 , the internal receive clock is used. If RCLKSRCE = 0, the external 
receive clock is used. 


8 


XVAREN 





This bit specifies fixed (XVAREN = 0) or variable (XVAREN = 1) data rate signaling 
when transmitting. With a fixed data rate, FSX is active for at least one XCLK cycle 
and then goes inactive before transmission begins. With variable data rate, FSX is 
active while all bits are being transmitted. When you use an external FSX and vari- 
able data rate signaling, the DX pin is driven by the transmitter when FSX is held ac- 
tive or when a word is being shifted out. 


9 


RVAREN 





This bit specifies fixed (RVAREN = 0) or variable (RVAREN = 1 ) data rate signaling 
when receiving. With a fixed data rate, FSR is active for at least one RCLK cycle and 
then goes inactive before the reception begins. With variable data rate, FSR is active 
while all bits are being received. 
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Table 8-3. Serial-Port Global-Control Register Bits Summary (Continued) 



Bit 


Name 


Reset Value 


Function 


10 


XFSM 





Transmit frame sync mode. Configures the port for continuous mode opera- 
tion(XFSM = 1 ) or standard mode (XFSM = 0). In continuous mode, only the first word 
of a block generates a sync pulse, and the rest are simply transmitted continuously 
to the end of the block. In standard mode, each word has an associated sync pulse. 


11 


RFSM 





Receive frame sync mode. Configures the port for continuous mode (RFSM =1) or 
standard mode (RFSM = 0) operation. In continuous mode, only the first word of a 
block generates a sync pulse, and the rest are simply received continuously without 
expectation of another sync pulse. In standard mode, each word received has an 
associated sync pulse. 


12 


CLKXP 





CLKX polarity. If CLKXP = 0, CLKX is active high. If CLKXP = 1 , CLKX is active low. 


13 


CLKRP 





CLKR polarity. If CLKRP = 0, CLKR is active high. If CLKRP =1 , CLKR is active low. 


14 


DXP 





DX polarity. If DXP = 0, DX is active high. If DXP = 1 , DX is active low. 


15 


DRP 





DR polarity. If DRP = 0, DR is active high. If DRP = 1 , DR is active low. 


16 


FSXP 





FSX polarity. If FSXP = 0, FSX is active high. If FSXP = 1 , FSX is active low. 


17 


FSRP 





FSR polarity. If FSRP = 0, FSR is active high. If FSRP = 1 , FSR is active low. 


19 — 18 


XLEN 


00 


These two bits define the word length of serial data transmitted. All data is assumed 
to be right-justified in the transmit buffer when fewer than 32 bits are specified. 

0—8 bits 1 — 24 bits 

1 — 16 bits 1 1 — 32 bits 


21 —20 


RLEN 


00 


These two bits define the word length of serial data received. All data is right-justified 
in the receive buffer. 

0—8 bits 1 — 24 bits 

1 — 16 bits 1 1 — 32 bits 


22 


XTINT 





Transmit timer interrupt enable. If XTINT = 0, the transmit timer interrupt is disabled. 
If XTINT = 1 , the transmit timer interrupt is enabled. 


23 


XINT 





Transmit interrupt enable. If XINT = 0, the transmit interrupt is disabled. If XINT= 1 , 
the transmit interrupt is enabled. Note that the GPU transmit interrupt flag XINT is 
the logical OR of the enabled transmit timer interrupt and the enabled transmit inter- 
rupt. 


24 


RTINT 





Receive timer interrupt enable. If RTINT = 0, the receive timer interrupt is disabled. 
If RTINT = 1 , the receive timer interrupt is enabled. 


25 


RINT 





Receive interrupt enable. If RINT = 0, the receive interrupt is disabled. If RINT= 1 , 
the receive interrupt is enabled. Note that the CPU receive interrupt flag RINT is the 
OR of the enabled receive timer interrupt and the enabled receive interrupt. 


26 


XRESET 





Transmit reset. If XRESET = 0, the transmit side of the serial port is reset. To take 
the transmit side of the serial port out of reset, set XRESET to 1 . However, do not 
set XRESET to 1 until at least three cycles after XRESET goes inactive. This applies 
only to system reset. Setting XRESET to does not change the contents of any of 
the serial-port control registers. It places the transmitter in a state corresponding to 
the beginning of a frame of data. Resetting the transmitter generates a transmit inter- 
rupt. Reset this bit during the time the mode of the transmitter is set. XFSM can be 
toggled without resetting the global-control register. 
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Table 8-3. Serial-Port Global-Control Register Bits Summary (Concluded) 



Bit 


Name 


Reset Value 


Function 


27 


RRESET 





Receive reset. If RRESET = 0, the receive side of the serial port is reset. To take 
the receive side of the serial port out of reset, set RRESET to 1 . Setting RRESET 
to does not change the contents of any of the serial-port control registers. It places 
the receiver in a state corresponding to the beginning of a frame of data. Reset this 
bit at the same time the mode of the receiver is set. RFSM can be toggled without 
resetting the global-control register. 


31 —28 


Reserved 


0-0 


Read as 0. 



8.2.2 FSX/DX/CLKX Port Control Register 



This 32-bit port control register controls the function of the serial port FSX, DX, 
and CLKX pins. At reset, all bits are set to 0. Table 8-4 defines the register bits, 
bit names, and functions. Figure 8-11 shows this port control register. 



Figure 8-11. FSX/DX/CLKX Port Control Register 
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xx = reserved bit, read as 0. 
R = read, W = write. 



RAW R/W. 



RAW 



Table 8-4. FSX/DX/CLKX Port Control Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





CLKXFUNC 





CLKXFUNC controls the function of CLKX. If CLKXFUNC = 0, CLKX is confi- 
gured as a general-purpose digital I/O port. If CLKXFUNC = 1 , CLKX is a serial 
port pin. 


1 


CLKXl/O 





]f CLKX I/O = 0, CLKX is configured as a general-purpose input pin. If CLKX 
T/O = 1 , CLKX is configured as a general-purpose output pin. 


2 


CLKXDATOUT 





Data output on CLKX. 


3 


CLKXDATIN 


X 


Data input on CLKX. A write has no effect. 


4 


DXFUNC 





DXFUNC controls the function of DX. If DXFUNC = 0, DX is configured as a 
general-purpose digital I/O port. If DXFUNC = 1 , DX is a serial port pin. 


5 


DXl/O 





If DX I/O = 0, DX is configured as a general-purpose input pin. If DX 1/0 = 1, 
DX is configured as a general-purpose output pin. 


6 


DXDATOUT 





Data output on DX. 


7 


DXDATIN 


Xt 


Data input on DX. A write has no effect. 


8 


FSXFUNC 





FSXFUNC controls the function of FSX. If FSXFUNC = 0, FSX is configured 
as a general-purpose digital I/O port. If FSXFUNC = 1 , FSX is a serial port pin. 



t x = or 1 
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Table 8-4. FSX/DX/CLKX Port Control Register Bits Summary (Continued) 



Reset Value 


Bit 


Name 


Function 





9 


FSX I/O 


If FSX j/O = 0, FSX is configured as a general-purpose input pin. 
If FSX 1/0 = 1, FSX is configured as a general-purpose output pin. 





10 


FSXDATOUT 


Data output on FSX. 


xt 


11 


FSXDATIN 


Data input on FSX. A write has no effect. 


0-0 


31—12 


Reserved 


Read as 0. 



t x = or 1 



8.2.3 FSR/DR/CLKR Port Control Register 



This 32-bit port control register is controlled by the function of the serial port 
FSR, DR, and CLKR pins. At reset, all bits are set to 0. Table 8-5 defines the 
register bits, the bit names, and functions. Figure 8-1 2 illustrates this port con- 
trol register. 



Figure 8-12. FSR/DR/CLKR Port Control Register 
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NOTE: 



xx = reserved bit, read as 0. 
R = read, W = write. 



RAV 



Table 8-5. FSR/DR/CLKR Port Control Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





CLKRFUNC 





CLKRFUNC controls the function of CLKR. If CLKRFUNC = 0, CLKR is 
configured as a general-purpose digital I/O port. If CLKRFUNC = 1 , 
CLKR is a serial port pin. 


1 


CLKRl/O 





If CLKRj/O = 0, CLKR is configured as a general-purpose input pin. 
If CLKRl/O = 1 , CLKR is configured as a general-purpose output pin. 


2 


CLKRDATOUT 





Data output on CLKR. 


3 


CLKRDATIN 


x 


Data input on CLKR. A write has no effect. 


4 


DRFUNC 





DRFUNC controls the function of DR. If DRFUNC = 0, DR is configured 
as a general-purpose digital I/O port. If DRFUNC = 1 , DR is a serial port 
pin. 


5 


DR. I/O 





If DRl/O = 0, DR is configured as a general-purpose input pin. 
If DRl/O = 1 , DR is configured as a general-purpose output pin. 


6 


DRDATOUT 





Data output on DR. 


7 


DRDATIN 


xt 


Data input on DR. A write has no effect. 


8 


FSRFUNC 





FSRFUNC controls the function of FSR. If FSRFUNC = 0, FSR is confi- 
gured as a general-purpose digital I/O port. If FSRFUNC = 1 , FSR is a 
serial port pin. 



t x = or 1 
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Table 8-5. FSR/DR/CLKR Port Control Register Bits Summary (Continued) 



Bit 


Name 


Reset Value 


Function 


9 


FSR I/O 





If FSR I/O = 0, FSR is configured as a general-purpose input pin. 
If FSR I/O = 1 , FSR is configured as a general-purpose output pin. 


10 


FSRDATOUT 





Data output on FSR. 


11 


FSRDATIN 


X 


Data input on FSR. A write has no effect. 


31 —12 


Reserved 


0-0 


Read as 0. 



8.2.4 Receive/Transmit Timer Control Register 



A 32-bit receive/transmit timer control register contains the control bits for the 
timer module. At reset, all bits are set to 0. Table 8-6 lists the register bits, bit 
names, andfunctions.Bits5 — control the transmitter timer. Bits 11 — 6con- 
trol the receiver timer. Figure 8-1 3 shows the register. The serial port receive/ 
transmit timer function is similar to timer module operation. It can be consid- 
ered as a 1 6-bit-wide timer. Refer to Section 8.1 for more information on timers. 



Figure 8- 13. Receive/Transmit Timer Control Register 

31 30 29 28 27 26 25 24 23 



22 



20 



19 



XX 


XX 




XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


15 


14 


13 


12 


11 


10 


9 


8 


7 


6 


5 


4 


3 


2 


1 





XX 


XX 


XX 


XX 


RTSTAT 


XX 


RCLKRC 


RC/P 


RHLD 


RGO 


XTSTAT 


XX 


XCLKSRC 


XC/P 


XHLD 


XGO 



R/W 



R/W 



R/W 



R/W 



R/W 



NOTE: xx = reserved bit, read as 0. 

R = read, W = write. 



Table 8-6. Receive/Transmit Timer Control Register 



Bit 


Name 


Reset Value 


Function 





XGO 





The XGO bit resets and starts the transmit timer counter. When XGO 
is set to 1 and the timer is not held, the counter is zeroed and begins 
incrementing on the next rising edge of the timer input clock. The XGO 
bit is cleared on the same rising edge. Writing to XGO has no effect 
on the transmit timer. 


1 


XHLD 





Transmit counter hold signal. When this bit is set to 0, the counter is dis- 
abled and held in its current state. The internal divide-by-two counter 
is also held so that the counter will continue where it left off when XHLD 
is set to 1 . The timer registers may be read and modified while the timer 
is being held. RESET has priority over XHLD. 


v 2 


XC/P 





XCIock/Pulse mode control. When XC/P = 1 .theclockmodeis chosen. 
The signaling of the status flag and external output has a 50-percent 
duty cycle. When XC/P = 0, the status flag and external output are active 
for one CLKOUT-cycle during each timer period. 
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Table 8-6. Receive/Transmit Timer Control Register (Concluded) 



Bit 


Name 


Reset Value 


Function 


3 


XCLKSRC 





This bit specifies the source of the transmit timer clock. When 
XCLKSRC = 1 , an internal clock with frequency equal to one-half the 
CLKOUTfrequency is used to increment the counter. When XCLKSRC 
= 0, an external signal from the CLKX pin can be used to increment the 
counter. The external clock source is SYNChronized internally, thus al- 
lowing for external aSYNChronous clock sources that do not exceed 
the specified maximum allowable external clock frequency, i.e., less 
thanf(H1)/2.6. 


4 


Reserved 





Read as zero. 


5 


XTSTAT 





This bit indicates the status of the transmit timer. It tracks what would 
be the output of the uninverted CLKX pin. This flag sets a CPU interrupt 
on a transition from to 1 . A write has no effect. 


6 


RGO 





The RGO bit resets and starts the receive timer counter. When RGO 
is set to 1 and the timer is not held, the counter is zeroed and begins 
incrementing on the next rising edge of the timer input clock. The RGO 
bit is cleared on the same rising edge. Writing to RGO has no effect 
on the receive timer. 


7 


RHLD 





Receive counter hold signal. When this bit is set to 0, the counter is dis- 
abled and held in its current state. The internal divide-by-two counter 
is also held so that the counter will continue where it left off when RHLD 
is set to 1 . The timer registers may be read and modified while the timer 
is being held. RESET has priority over RHLD. 


8 


RC/P 





RCIock/Pulse mode control. When RC/P = 1 , the clock mode is chosen. 
The signaling of the status flag and external output has a 50-percent 
duty cycle. When RC/P = 0, the status flag and external output are ac- 
tive for one CLKOUT cycle during each timer period. 


9 


RCLKSRC 





This bit specifies the source of the receive timer clock. When RCLKSRC 
= 1 , an internal clock with frequency equal to one-half the CLKOUTfre- 
quency is used to increment the counter. When RCLKSRC = 0, an ex- 
ternal signal from the CLKR pin can be used to increment the counter. 
The external clock source is SYNChronized internally, thus allowing for 
external aSYNChronous clock sources that do not exceed the specified 
maximum allowable external clock frequency, i.e., less than f(H1)/2.6. 


10 


Reserved 





Read as. zero. 


11 


RTSTAT 





This bit indicates the status of the receive timer. It tracks what would be 
the output of the uninverted CLKR pin. This flag sets a CPU interrupt 
on a transition from to 1 . A write has no effect. 


31—12 


Reserved 


0-0 


Read as 0. 
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8.2.5 Receive/Transmit Timer Counter Register 

The receive/transmit timer counter register is a 32-bit register (see 
Figure 8-14). Bits 15 — are the transmit timer counter, andbits31 — 16are 
the receive timer counter. Each counter is set to whenever it increments to 
the value of the period register (Section 8.2.6). It is also set to at reset. 

Figure 8-14. Receive/Transmit Timer Counter Register 
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NOTE: All bits are read/write. 



8.2.6 Receive/Transmit Timer Period Register 

The receive/transmit timer period register is a 32-bit register (see 
Figure 8-1 5). Bits 15 — are the timer transmit period, and bits 31 — 16 are 
the receive period. Each register is used to specify the period of the timer. It 
is also set to at reset. 

Figure 8- 15. Receive/Transmit Timer Period Register 
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NOTE: All bits are read/write. 



8.2.7 Data-Transmit Register 



When the data-transmit register (DXR) is loaded, the transmitter loads the 
word into the transmit shift register (XSR), and the bits are shifted out. The 
delay from a write to DXR until an FSX occurs (or can be accepted) is two CLKX 
cycles. The word is not loaded into the shift register until the shifter is empty. 
When DXR is loaded into XSR, the XRDY bit is set, specifying that the buffer 
is available to receive the next word. Four tap points within the transmit shift 
register are used to transmit the word. These tap points correspond to the four 
data word sizes and are illustrated in Figure 8-1 6 . The shift is a left-shift (LSB 
to MSB) with the data shifted out of the MSB corresponding to the appropriate 
tap point. 



8-21 



Serial Ports 



Figure 8-16. Transmit Buffer Shift Operation 
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8.2.8 Data-Receive Register 



When serial data is input, the receiver shifts the bits into the receive shift regis- 
ter (RSR). When the specified number of bits are shifted in, the data-receive 
register (DRR) is loaded from RSR, and the RRDY status bit is set. The receiv- 
er is double-buffered. If the DRR has not been read and the RSR is full, the re- 
ceiver is frozen. New data coming into the DR pin is ignored. The receive shifter 
will not write over the DRR. The DRR must be read to allow new data in the 
RSR to be transferred to the DRR. When a write to DRR occurs at the same 
time that a RSR to DRR transfer takes place, the RSR to DRR transfer has 
priority. 

Data is shifted to the left (LSB to MSB). Figure 8-17 illustrates what happens 
when words less than 32 bits are shifted into the serial port. In this figure, it is 
assumed that an 8-bit word is being received and that the upper three bytes 
of the receive buffer are originally undefined. In the first portion of the figure, 
byte a has been shifted in. When byte b is shifted in, byte a is shifted to the left. 
When the data receive register is read, both bytes a and b are read. 



Figure 8-17. Receive Buffer Shift Operation 
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8.2.9 Serial-Port Operation Configurations 



Several configurations are provided for the operation of the serial port clocks 
and timer. The clocks for each serial port can originate either internally or exter- 
nally. Figure 8-1 8 shows serial port clocking in the I/O mode (FUNC = 0) when 
CLKX is either an input or an output. Figure 8-1 9 shows clocking in the serial- 
port mode (FUNC = 1 ). Both figures use a transmit section for an example. The 
same relationship holds for a receive section. 



Figure 8- 18. Serial- Port Clocking in I/O Mode 
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Figure 8-19. Serial-Port Clocking in Serial-Port Mode 
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8.2.10 Serial-Port Timing 



The formula for calculating the frequency of the serial-port clock with an inter- 
nally generated clock is dependent upon the operation mode of the serial-port 
timers, defined as 

f (pulse mode) = f (timer clock)/period register 

f (clock mode) = f (timer clock)/(2 x period register) 

An externally generated serial-port clock (CLKX or CLKR) has a maximum fre- 
quency of less than f(H1 )/2.6. See serial port timing in Chapter 1 3. Also, see 
subsection 8.1 .3 for information on timer pulse/clock generation. 

Transmit data is clocked out on the rising edge of the selected serial-port clock. 
Receive data is latched into the receive shift register on the falling edge of the 
serial-port clock. All data is transmitted and loaded MSB first and right-justified. 
If fewer than 32 bits are transferred, the data are right-justified in the 32-bit 
transmit and receive buffers. Therefore, the LSBs of the transmit buffer are 
the bits that are transmitted. 
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The transmit ready (XRDY) signal specifies that the data-transmit register 
(DXR) is available to be loaded with new data. XRDY goes active as soon as 
the data is loaded into the transmit shift register (XSR). The last word may still 
be shifting out when XRDY goes active. If DXR is loaded before the last word 
has completed transmission, the data bits transmitted will be consecutive; i.e., 
the LSB of the first word immediately precedes the MSB of the second, with 
all signaling valid as in two separate transmits. XRDY goes inactive when DXR 
is loaded and remains inactive until the data is loaded into the shifter. 

The receive ready (RRDY) signal is active as long as a new word of data is 
loaded into the data receive register and has not been read. As soon as the 
data is read, the RRDY bit is turned off. 

When FSX is specified as an output, the activity of the signal is determined 
solely by the internal state of the serial port. If a fixed data rate is specified, FSX 
goes active when DXR is loaded into XSR to be transmitted out. One serial- 
clock cycle later, FSX turns inactive, and data transmission begins. If a variable 
data rate is specified, the FSX pin is activated when the data transmission be- 
gins, and remains active during the entire transmission of the word. Again, the 
data is transmitted one clock cycle after it is loaded into the data transmit regis- 
ter. 

An input FSX in the fixed data rate mode should go active for at least one serial 
clock cycle and then inactive to initiate the data transfer. The transmitter then 
sends the number of bits specified by the LEN bits. In the variable data-rate 
mode, the transmitter begins sending from the time FSX goes active until the 
number of specified bits has been shifted out. In the variable data-rate mode, 
when the FSX status changes prior to all the data bits being shifted out, the 
transmission completes, and the DX pin is placed in a high-impedance state. 
An FSR input is exactly complementary to the FSX. 

When using an external FSX, if DXR and XSR are empty, a write to DXR results 
in a DXR-to-XSR transfer. This data is held in the XSR until an FSX occurs. 
When the external FSX is received, the XSR begins shifting the data. If XSR 
is waiting for the extemaf FSX, a write to DXR will change DXR, but a DXR-to- 
XSR transfer will not occur. XSR begins shifting when the external FSX is re- 
ceived, or when it is reset using XRESET 

Continuous Transmit and Receive Modes 

When continuous mode is chosen, consecutive writes do not generate or ex- 
pect new sync pulse signaling. Only the first word of a block begins with an ac- 
tive synchronization. Thereafter, data continues to be transmitted as long as 
new data is loaded into DXR before the last word has been transmitted. As 
soon as TXRDY is active and all of the data has been transmitted out of the 
shift register, the DX pin is placed in a high-impedance state, and a subsequent 
write to DXR initiates a new block and a new FSX. 

Similarly with FSR, the receiver continues shifting in new data and loading 
DRR. If the data-receive buffer is not read before the next word is shifted in, 
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subsequent incoming data will be lost. The RFSM bit can be used to terminate 
the receive-continuous mode. 

Handshake Mode 

The handshake mode (HS = 1) allows for direct connection between proces- 
sors. In this mode, all data words are transmitted with a leading 1 (see 
Figure 8-20). For example, if an 8-bit word is to be transmitted, the first bit sent 
is a 1 , followed by the 8-bit data word. 

In this mode, once the serial port transmits a word, it will not transmit another 
word until it receives a separately transmitted zero bit. Therefore, the 1 bit that 
precedes every data word is, in effect, a request bit. 

Figure 8-20. Data Word Format in Handshake Mode 
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After a serial port receives a word (with the leading 1 ) and that word has been 
read from the DRR, the receiving serial port sends a single to the transmitting 
serial port. Thus, the single bit acts as an acknowledge bit (see Figure 8-21 ). 
This single acknowledge bit is sent every time the DRR is read, even if the DRR 
does not contain new data. 



Figure 8-21. Single Zero Sent as an Acknowledge 
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When the serial port is placed in the handshake mode, the insertion and dele- 
tion of a leading 1 for transmitted data, the sending of a for acknowledgement 
of received data, and the waiting for this acknowledge bit are all performed au- 
tomatically. Using this scheme, it is simple to connect processors with no exter- 
nal hardware and to guarantee secure communication. Atypical configuration 
is shown in Figure 8-22. 

In the handshake mode, FSX is automatically configured as an output. Contin- 
uous mode is automatically disabled. After a system reset or XRESET, the 
transmitter is always permitted to transmit. The transmitter and receiver must 
be reset when entering the handshake mode. 
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Figure 8-22. Direct Connection Using Handshake Mode 
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8.2.11 Serial-Port Interrupt Sources 

A serial port has four interrupt sources: 

1 ) The transmit timer interrupt: The rising edge of XTSTAT causes a single- 
cycle interrupt pulse to occur. When XTINT is 0, this interrupt pulse is dis- 
abled. 

2) The receive timer interrupt: The rising edge of RTSTAT causes a single- 
cycle interrupt pulse to occur. When RTINT is 0, this interrupt pulse is dis- 
abled. 

3) The transmitter interrupt: Occurs immediately following a DXR-to-XSR 
transfer. The transmitter interrupt is a single-cycle pulse. When the 
serial-port global-control register bit XINT is 0, this interrupt pulse is dis- 
abled. 

4) The receiver interrupt: Occurs immediately following a RSR to DRR trans- 
fer. The receiver interrupt is a single-cycle pulse. When the serial-port glo- 
bal-control register bit RINT is 0, this interrupt pulse is disabled. 

The transmit timer interrupt pulse is ORed with the transmitter interrupt pulse to create 
the CPU transmit interrupt flag XINT. The receive timer interrupt pulse is ORed with the 
receiver interrupt pulse to create the CPU receive interrupt flag RINT. 
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8.2.12 Serial-Port Functional Operation 



The following paragraphs and figures illustrate the functional timing of the vari- 
ous serial-port modes of operation. The timing descriptions are presented with 
the assumption that all signal polarities are configured to be positive, i.e., 
CLKXP = CLKRP = DXP = DRP = FSXP = FSRP = 0. Logical timing, in situa- 
tions where one or more of these polarities are inverted, is the same except 
with respect to the opposite polarity reference points, i.e. rising vs. falling 
edges, etc. 

These discussidns pertain to the numerous operating modes and configura- 
tions of the serial-port logic. When it is necessary to switch op erating mo des 
or change configurations of the serial port, do this only when XRESET or 
RRESET are asserted (low), as a ppropriate. Therefore, when transmit config- 
urations are modifi ed, XRES ET should be low, and when receive configura- 
tions are modified, RRESET should be low. When you use handshake mode, 
however, since the transmitter a nd receive r ar e interrelat ed, you should make 
any configuration changes with XRESET and RRESET both low. 

All of the serial-port operating configurations can be broadly classified in two 
categories: fixed data-rate timing and variable data-rate timing. The following 
paragraphs discuss fixed and variable data-rate operation and all of their varia- 
tions. 

Fixed Data-Rate Timing Operation 

Fixed data-rate serial-port transfers can occur in two varieties: burst mode and 
continuous mode. In burst mode operation, transfers of single words are sepa- 
rated by periods of inactivity on the serial port. In continuous mode, there are 
no gaps between successive word transfers; the first bit of a new word is trans- 
ferred on the next CLKX/R pulse following the last bit of the previous word. This 
occurs continuously until the process is terminated. 

In burst mode with fixed data-rate timing, FSX/FSR pulses initiate transfers, 
and each transfer involves a single word. With an internally generated FSX 
(see Figure 8-23), transmission is initiated by loading DXR. In this mode, there 
is a delay of approximately 2.5 CLKX cycles (depending on CLKX and H1 fre- 
quencies) from the time DXR is loaded until FSX occurs. With an external FSX, 
the FSX pulse initiates the transfer, and the 2.5-cycle delay effectively be- 
comes a setup requirement for loading DXR with respect to FSX. Therefore, 
in this case, DXR must be loaded no later than 3 CLKX cycles before FSX oc- 
curs. Once the XSR is loaded from the DXR, an XINT is generated. 
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Figure 8-23. Fixed Burst Mode 
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In receive operations, once a transfer is initiated, FSR is ignored until the last 
bit. For burst mode transfers, FSR must be low during the last bit, or another 
transfer will be initiated. After a full word has been received and transferred to 
the DRR, an RINT is generated. 

In fixed data rate mode, continuous transfers may be performed even if 
R/XFSM = 0,as long as properly timed frame synchronization is provided, or 
if DXR is reloaded each cycle with an internally generated FSX (see 
Figure 8-24). 



Figure 8-24. Fixed Continuous Mode With Frame Sync 
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For receive operations and with externally generated FSX, once transfers have 
begun, frame sync pulses are required only during the last bit transferred to 
initiate another contiguous transfer. Otherwise, frame sync inputs are ignored. 
Therefore, continuous transfers will occur if frame sync is held high. With an 
internally generated FSX, there is a delay of approximately 2.5 CLKX cycles 
from the time DXR is loaded until FSX occurs. This delay occurs each time 
DXR is loaded; therefore, during continuous transmission, the instruction that 
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loads DXR must be executed by the N-3 bit for an A/-bit transmission. Since 
delays due to pipelining may vary, a conservative margin of safety should be 
incorporated in allowing for this delay. 

Once the process begins, an XINT and an RINT are generated atthe beginning 
of each transfer. The XINT indicates that the XSR has been loaded from DXR 
and can be used to cause DXR to be reloaded. To maintain continuous trans- 
mission in this mode, especially with an internally generated FSX, DXR must 
be reloaded early in the ongoing transfer. 

The RINT indicates that a full word has been received and transferred into the 
DRR. RINT is therefore commonly used to indicate an appropriate time to read 
DRR. 

Continuous transfers are terminated by discontinuing frame sync pulses or, 
in the case of internally generated FSX, not reloading DXR. 

Continuous serial-port transfers can be accomplished without the use of frame 
sync pulses if R/XFSM are set to one. In this mode, operation of the serial port 
is similar to continuous operation with frame sync except that a frame sync 
pulse is involved only in the first word transferred, and no further frame sync 
pulses are used. Following the first word transferred (see Figure 8-25), no in- 
ternal frame sync pulses are generated, and frame sync inputs are ignored. 
Additionally, R/XFSM should be set prior to or during the first word transferred 
and must be set no later than the transfer of the AM bit of the first word, except 
for transmit operations. For transmit operations in the fixed data-rate mode, 
XFSM must be set no later than the /V-2 bit. Clearing R/XFSM must be per- 
formed no later than the AM bit to be recognized in the current cycle. 



Figure 8-25. Fixed Continuous Mode Without Frame Sync 
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Timing of RINT and XINT and data transfers to and from DXR and DRR, re- 
spectively, are the same as in fixed data-rate continuous mode with frame 
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sync. This mode of operation also exhibits the same delay of 2.5 CLKX cycles 
after DXR is loaded before an internal FSX is generated. As in the case of con- 
tinuous operation in fixed data-rate mode with frame sync, DXR must be re- 
loaded no later than transmission of the A/-3 bit. 

When you use continuous operation in fixed data-rate mode, R/XFSM may be 
set and cleared as desired, even during active transfers, to enable or disable 
the use of frame sync pulses as dictated by system requirements. Under most 
conditions, the effect of changing the state of R/XFSM occurs during the trans- 
fer in which the R/XFSM change was made, provided the change was made 
early enough in the transfer. For transmit operations with internal FSX in fixed 
data-rate mode, however, a one-word delay occurs before frame sync pulse 
generation resumes when clearing XFSM to zero (see Figure 8-26). There- 
fore, one additional word is transferred in this case before the next FSX pulse 
is generated. Also note that, as discussed previously, clearing XFSM will be 
recognized during the transmission of the current word being transmitted as 
long as XFSM is cleared no later than the AM bit. Setting XFSM is recognized 
as long as XFSM is set no later than the A/-2 bit. 



Figure 8-26. Exiting Fixed Continuous Mode Without Frame Sync, FSX Internal 



I II I I I 

I 1st Word I 2nd Word I 3rd Word I 4th Word I 5th Word I 

II I I I I 

clkx JTJTJTJTJTJ\j^^ 

i i i i i 



FSX 
(Internal) 

DX 



LOAD DXR 



J~L 



J - L 



n n 



^EiX^£<AN)b^ 



SET XFSM RESET XFSM 



Variable Data-Rate Timing Operation 

Variable data-rate timing also supports operation in either burst or continuous 
mode. Burst mode operation with variable data-rate timing is similar to burst 
mode operation with fixed data rate timing. With variable data-rate timing (see 
Figure 8-27), however, FSX/R and data timing differ slightly at the beginning 
and end of transfers. Specifically, there are three major differences between 
fixed and variable data-rate timing: 

1) FSX/R pulses typically last for the entire transfer interval, although FSR 
and external FSX are ignored after the first bit transferred. FSX/R pulses 
in fixed data-rate mode typically last only one CLKX/R cycle but can last 
longer. 



8-31 



Serial Ports 



2) Data transfer begins during the CLKX/R cycle in which FSX/R occurs, rath- 
er than the CLKX/R cycle following FSX/R, as is the case with fixed data- 
rate timing. 

3) With variable data-rate timing, frame sync inputs are ignored until the end 
of the last bit transferred, rather than the beginning of the last bit trans- 
ferred as is the case with fixed data-rate timing. 



Figure 8-27. Variable Burst Mode 
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When you transmit continuously in variable data-rate mode with frame sync, 
timing is the same as for fixed data-rate mode, except for the differences be- 
tween these two modes as described under burst mode operation with variable 
data-rate timing. The only other exception to this is that DXR must be reloaded 
no later than the A/-4 bit to maintain continuous operation of the variable data- 
rate mode (see Figure 8-28); no later than the A/-3 bit for fixed data-rate 
mode. 



Figure 8-28. Variable Continuous Mode With Frame Sync 
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Continuous operation in variable data rate mode without frame sync is also 
similarto continuous operation without frame sync in fixed data-rate mode. As 
with variable data-rate mode continuous operation with frame sync (see 
Figure 8-29), DXR must be reloaded no later than the AM- bit to maintain con- 
tinuous operation. Additionally, when R/XFSM is set or cleared in the variable 
data-rate mode, the modification must be made no later than the AM bit for 
the result to be affected in the current transfer. 



Figure 8-29. Variable Continuous Mode Without Frame Sync 
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8.2.13 TMS320C3X Serial Port Interface Examples 
8.2. 13. 1 Handshake Mode Example 



When handshake mode is used, both the transmit (FSX/DS/CLKX) and re- 
ceive (FSR/DR/CLKR) signals are used to transmit and receive data, respec- 
tively. In other words, even if the TMS320C3x serial port is receiving data only 
with handshake mode, the transmit signals are still needed to transmit the ac- 
knowledge signal. The serial port registers, setup for the TMS320C3x serial 
port handshake communication, as shown in Figure 8-22 are shown below: 



Global control 
Transmit port control 
Receive port control 
S_port timer control 
S_port timer count 
S_port timer period 

Note: x = user configurable. 



01 1 xOxOxxxxOOOOOOOOxxd 1 001 00b 

0111h 

0111h 

OFh 

Oh 

01 h (if two C3xs have the same 

system clock) 



Since the FSX is set as an output and continuous mode is disabled when hand- 
shake mode is selected, the SFSM and RFSM bits should be set to and the 
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FSXOUT bit should be set to 1 in the global control register. The XRESET, 
RRESET, and HS bits should also be set to 1 in order to start the handshake 
communication. It is recommended that the polarity of the serial port pins be 
set to active high for simplification. Although the CLKX/CLKR can be set as ei- 
ther input or output, it is recommended to set the CLKX as output and the CLKR 
as input. The rest of the bits are user configurable as long as both serial ports 
have the consistent setup. 

The serial port timer is needed only if the CLKX or CLKR is configured as an 
output. In the above case, since only the CLKX is configured as an output, the 
timer control register should be set to OFh. When the serial port timer is used, 
the serial timer period register must also be set to the proper value for the clock 
speed. The serial port timer clock speed setup is similar to the TMS320C3x tim- 
er. Refer to Section 8.1 for detailed information on timer clock generation. 

The maximum clock frequency for serial transfers is F(CLKIN)/4 if the internal 
clock is used and F(CLKIN)/5.2 if an external clock is used. Therefore, if two 
TMS320C3xs have the same system clock, as in the the case above., the timer 
period register should be set to be equal to or greater than 1 which make the 
clock frequency equal to F(CLKIN)/8. 

Examples of serial port register setups for the above case are shown below. 
(Assume two TMS320C3xs have the same system clock.) 



32 bits, fixed data rate, burst mode, 
FSX (output), CLKX (output) = F(CLKIN)/8 
CLKR (input), hands-hake mode, transmit 
and receive interrupt is enabled. 



Setup 1 : 




Global control 


= 0EBC0064h 


Transmit port control 


= 0111h 


Receive port control 


= 0111h 


S_port timer control 


= OFh 


S_port timer count 


= Oh 


S_port timer period 


> 01h 


Setup 2: 




Global control 


= 0C000364h 


Transmit port control 


= 0111h 


Receive port control 


= 0111h 


S_port timer control 


= OFh 


S_port timer count 


= Oh 


S_port timer period 


> 01h 



8 bits, variable data rete, burst mode, 
FSX (output), CLKX (output) =f(CLKIN)/24 
CLKR (input), handshake mode, transmit 
and receive interrupt is disabled. 



Since the data has a leading 1 and the acknowledge signal is a in the hand- 
shake mode, the TMS320C3x serial port can distinguish the signals between 
the data and the acknowledge signal. Therefore, even if the TMS320C3x serial 
port receives the data before the acknowledge signal, the data will not be misin- 
terpreted as the acknowledge signal and be lost. In addition, the acknowledge 
signal is not generated until the data is read from the data receive register, 
DRR. Therefore, the TMS320C3x will not transmit the data and the acknowl- 
edge signal simultaneously. 
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8.2. 13.2 Serial AlC Interface Example 



The TLC320C4x analog interface chips (AlC) from Texas Instruments offer a 
zero glue-logic interface to the TMS320C3x family of DSPs. The interface is 
shown in Figure 8-30. This interface is used as an example of the TMS320C3X 
serial port configuration and operation. 



Figure 8-30. TMS320C3x Zero Glue-Logic Interface to TLC3204x Example 
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The TMS320C3x resets the AlC through the external pin XFO. It also generates 
the master clock for the AlC through the timer output pin, TCLKO. (Precise 
selection of a sample rate may require the use of an external oscillator rather 
than the TCLKO outputto drive the AlC MCLK input.) In turn, the AlC generates 
the CLKRO and CLKXO shift clocks as well as the FSRO and FSXO frame syn- 
chronization signals. 

A typical use of the AlC requires an 8 kHz sample rate of the analog signal. If 
the clock input frequency to the TMS320C3x device is 30 MHz, the following 
values should be loaded into the serial port and timer registers. 



Serial Port: 

Port global control register: 

FSX/DX/CLKX port control register 

FSR/DR/CLKR port control register 

Timer: 

Timer global control register 

Timer period register 



0E970300h 
00000111 h 
00000111 h 

000002C1h 
00000001 h 



8.2. 13.3 Serial AID and D/A Interface Example 



The DSP201/2 and DSP101/2 family of D/As and A/Ds from Burr Brown also 
offer a zero glue-logic interface to the TMS320C3x family of DSPs. The inter- 
face is shown in Figure 8-31. This interface is used as an example of the 
TMS320C3x serial port configuration and operation. 
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Figure 8-3 1. TMS320C3x Zero Glue-Logic Interface to Burr Brown AID and D/A Example 
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The DSP1 02 A/D is interfaced to the TMS320C3x serial port receive side; the 
DSP202 D/A is interfaced to the transmit side. The A/Ds and D/As are hard 
wired to run in cascade mode. In this mode, when the TMS320C3x initiates a 
convert command to the A/D, via the TCLKO pin, both analog inputs are con- 
verted into two 1 6-bit words which are concatenated to form one 32-bit word. 
The A/D signals the TMS320C3x, via the A/D's SYNC signal (connected to the 
TMS320C3x FSRO pin), that serial data is to be transmitted. The 32-bit word 
is then serially transmitted, MSB first, out the SOUTA serial pin of the DSP1 02 
to the DRO pin of the TMS320C3x serial port. The TMS320C3x is programmed 
to drive the analog interface bit clock from CLKXO pin of the TMS320C3x. The 
bit clock drives both the A/D's and D/A's XCLK input. The TMS320C3x transmit 
clock also acts as the input clock on the receive side of the TMS320C3x serial 
port. Since the receive clock is synchronous to the internal clock of the 
TMS320C3x, the receive clock can run at full speed (that is, f(H1)/2). 

Similarly, upon receiving a convert command, the pipelined D/A converts the 
last word received from the TMS320C3x and signals the TMS320C3X, via the 
SYNC signal (connected to the TMS320C3x FSXO pin), to begin transmitting 
a 32-bit word representing the two channels of data to be converted. The data, 
transmitted from the TMS320C3x DXO pin is input to both the SINA and SINB 
inputs of the D/A as shown in the figure. 
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The TMS320C3x is set up to transfer bits at the maximum rate of about 8 Mbps 
with a dual channel sample rate of about 44.1 kHz. This standard mode, fixed 
data rate signaling interface is configured by setting the following registers as 
described below: 

Serial Port: 

Port global control register: 0EBC0040h 

FSX/DX/CLKX port control register 000001 1 1 h 

FSR/DR/CLKR port control register 000001 1 1 h 

Receive/transmit timer control register OOOOOOOFh 

Timer: 

Timer global control register 000002C1 h 

Timer period register 0000005Ah 



8.2.14 Serial Port Initialization/Reconfiguration 



The serial ports are controlled through memory-mapped registers located on 
the dedicated peripheral bus. A general procedure for initializing and/or recon- 
figuring the serial ports follows: 

1 ) Halt the serial port by clearing the XRESET and/or RRESET bits of the ser- 
ial-port global-control register. This can be accomplished by writing a to 
the serial-p ort global-control register. Note that the serial ports are halted 
on RESET. 

2) Configure the serial port via the serial-port global-control register (with 
XRESET = RRESET = 0), FSX/DX/CLKX and FSR/DR/CLKR port control 
r egisters , as well as the receive/transmit timer control register (with XHLD 
= RHLD = 0, receive/transmit timer counter register and the receive/trans- 
mit timer period register, if necessary. Refer to subsection 8.2.13, 
"TMS320C3x Serial Port Interface Examples." 

3) Start the serial port by setting the X RESET an d RRES ET bits of the serial- 
port global-control register and the XHLD and RHLD bits of the serial port 
receive/transmit timer control register, if necessary. 
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8.3 DMA Controller 



The TMS320C3x has an on-chip Direct Memory Access (DMA) controller that 
reduces the need for the CPU to perform input/output functions. The DMA con- 
troller can perform input/output operations without interfering with the opera- 
tion of the CPU. Therefore, it is possible to interface the TMS320C3x to slow 
external memories and peripherals (A/Ds, serial ports, etc.) without reducing 
the computational throughput of the CPU. The result is improved system per- 
formance and decreased system cost. 

A DMA transfer consists of two operations: a read from a memory location and 
a write to a memory location. The DMA controller can read from and write to 
any location in the TMS320C3X memory map. This includes all 
memory-mapped peripherals. The operation of the DMA is controlled with the 
following set of memory-mapped registers: 

Ql DMA global-control register 

□i DMA source address register 

Qi DMA destination address register 

Q DMA transfer counter register 

These registers, their memory-mapped addresses, and their functions are 
shown in Figure 8-32. Each of these DMA registers is discussed in the 
succeeding subsections. 
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Figure 8-32, Memory-Mapped Locations for a DMA Channel 
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8.3.1 DMA Global-Control Register 



The global-control register controls the state in which the DMA controller oper- 
ates. This register also indicates the status of the DMA, which changes every 
cycle. Source and destination addresses can be incremented, decremented, 
or SYNChronized using specified global-control register bits. At system reset, 
all bits in the DMA control register are set to 0. Table 8-7 lists the register bits, 
names, and functions. Figure 8-33 shows the bit configuration of the global- 
control register. 
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Figure 8-33. DMA Global-Control Register 
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NOTE: xx = Reserved bit, read as 0. 
R = read, W = write. 

Table 8-7. DMA Global-Control Register Bits 



Bit 


Name 


Reset Value 


Function 


— 1 


START 


0-0 


These bits control the state in which the DMA starts and stops. The 
DMA may be stopped without any loss of data (see Table 8-8). 


2 — 3 


STAT 


0-0 


These bits indicate the status of the DMA and change every cycle 
(see Table 8-9). 


4 


INCSRC 





If INCSRC = 1 , the source address is incremented after every read. 


5 


DECSRC 





If DECSRC = 1 , the source address is decremented after every 
read. If INCSRC = DECSRC, the source address is not modified af- 
ter a read. 


6 


INCDST 





If INCDST = 1 , the destination address is incremented after every 
write. 


7 


DECDST 





If DECDST = 1 , the destination address is decremented after every 
write. If INCDST = DECDST, the destination address is not modified 
after a write. 


9 — 8 


SYNC 


0-0 


The SYNC bits determine the timing synchronization between the 
events initiating the source and the destination transfers. The inter- 
pretation of the SYNC bits is shown in Table 8-1 0. 


10 


TC 





The TC bit affects the operation of the transfer counter. If TC = 0, 
transfers are not terminated when the transfer counter becomes 
zero. If TC = 1 , transfers are terminated when the transfer counter 
becomes zero. 


11 


TCINT 





If TCINT = 1, the DMA interrupt is set when the transfer counter 
makes a transition to zero. If TCINT = 0, the DMA interrupt is not set 
when the transfer counter makes a transition to zero. 


31—12 


Reserved 


0-0 


Read as zero. 



8-40 



Peripherals 



DMA Controller 



Table 8-8. START Bits and Operation of the DMA (Bits 0-1) 



START 


Function 


00 


DMA read or write cycles in progress will be completed; any data read will be ig- 
nored. Any pending read or write will be cancelled. The DMA is reset so that when 
it starts, a new transaction begins; i.e., a read is performed. (Reset value) 


01 


If a read or write has begun, it is completed before it stops: for example, in the 
middle or at the end of a DMA transfer. If a read or write has not begun, no read 
or write is started. 


1 


If a DMA transfer has begun, the entire transfer is completed (including both read 
and write operations) before stopping. If a transfer has not begun, none is started. 


1 1 


DMA starts from reset or restarts from the previous state. 



Table 8-9. STAT Bits and Status of the DMA (Bits 2-3) 



STAT 


Function 


00 


DMA is being held between DMA transfer (between a write and read). This is the 
value at reset. (Reset value) 


01 


DMA is being held in the middle of a DMA transfer, i.e., between a read and a write. 


1 


Reserved. 


1 1 


DMA busy; i.e., DMA is performing a read or write. 



Table 8-10. SYNC Bits and Synchronization of the DMA (Bits 8-9) 



SYNC 



00 



01 



1 



1 1 



Function 



No synchronization. Enabled interrupts are ignored. (Reset value) 



Source synchronization. A read is performed when an enabled interrupt occurs. 



Destination synchronization. A write is performed when an enabled interrupt oc- 
curs. 



Source and destination synchronization. A read is performed when an enabled in- 
terrupt occurs. A write is then performed when the next enabled interrupt occurs. 
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8.3.2 Destination and Source Address Registers 



The DMA destination and source address registers are 24-bit registers whose 
contents specify destination and source addresses. As specified by control bits 
DECSRC, INCSRC, DECDST, and INCDSTof the DMA global control register, 
these registers are incremented and decremented at the end of the corre- 
sponding memory access, that is, the source register for a read, the destination 
register for a write. On system reset, is written to these registers. 



8.3.3 Transfer Counter Register 



The transfer counter register is a 24-bit register, controlled by a 24-bit counter 
that counts down. The counter decrements at the beginning of a DMA memory 
write. In this way, it can be used to control the size of a block of data transferred. 
The transfer counter register is set to at system reset. When TCINT bit of 
DMA global control register is set, the transfer counter register will cause a 
DMA interrupt flag to be set upon count down to zero. 



8.3.4 CPU/DMA Interrupt Enable Register 



The CPU/DMA interrupt enable register (IE) is a 32-bit register located in the 
CPU register file. The CPU interrupt enable bits are in locations 10 — 1 . The 
DMA interrupt enable bits are in locations 26 — 1 6. A 1 in a CPU/DMA interrupt 
enable register bit enables the corresponding interrupt. A disables the corre- 
sponding interrupt. At reset, is written to this register. 

Table 8-11 lists the bits, names, and functions of the CPU/DMA interrupt en- 
able register. Figure 8-34 shows the IE register. The priority and decoding 
schemes of CPU and DMA interrupts are identical. Note that when the DMA 
receives an interrupt, this interrupt is acted upon according to the SYNC field 
of the DMA control register. Also note that an interrupt may affect the DMA but 
not the CPU and may affect the CPU but not the DMA. Refer to Chapter 6 and 
to subsection 3.1.8. 
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Figure 8-34. CPU/DMA Interrupt Enable Register 
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NOTE: xx = Reserved bit, read as 0. 
R = read, W = write. 



Table 8-11. CPU/DMA Interrupt Enable Register Bits 



Bit 


Name 


Function 





EINTO 


Enable external interrupt (CPU) 


1 


EINT1 


Enable external interrupt 1 (CPU) 


2 


EINT2 


Enable external interrupt 2 (CPU) 


3 


EINT3 


Enable external interrupt 3 (CPU) 


4 


EXINTO 


Enable serial-port transmit interrupt (CPU) 


5 


ERINTO 


Enable serial-port receive interrupt (CPU) 


6 


EXINT1 


Enable serial-port 1 transmit interrupt (CPU) 


7 


ER1NT1 


Enable serial-port 1 receive interrupt (CPU) 


8 


ETINTO 


Enable timer interrupt (CPU) 


9 


ETINT1 


Enable timer 1 interrupt (CPU) 


10 


EDINT 


Enable DMA controller interrupt (CPU) 


15—11 


Reserved 


Read as 


16 


EINTO 


Enable external interrupt (DMA) 


17 


EINT1 


Enable external interrupt 1 (DMA) 


18 


EINT2 


Enable external interrupt 2 (DMA) 


19 


EINT3 


Enable external interrupt 3 (DMA) 


20 


EXINTO 


Enable serial-port transmit interrupt (DMA) 


21 


ERINTO 


Enable serial-port receive interrupt (DMA) 


22 


EXINT1 


Enable serial-port 1 transmit interrupt (DMA) 


23 


ERINT1 


Enable serial-port 1 receive interrupt (DMA) 


24 


ETINTO 


Enable timer interrupt (DMA) 


25 


ETINT1 


Enable timer 1 interrupt (DMA) 


26 


EDINT 


Enable DMA controller interrupt (DMA) 


31—27 


Reserved 


Read as 
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8.3.5 DMA Memory Transfer Operation 

Each DMA memory transfer consists of two parts: 

1 ) Read data from the address specified by the DMA source register. 

2) Write data that has been read to the address specified by the DMA destina- 
tion register. 

A transfer is complete only when the read and write are complete. A transfer 
may be stopped by setting the START bits to the desired value. When the DMA 
is restarted (START =1 1), it completes any pending transfer. 

At the end of a DMA read, the source address is modified as specified by the 
SRCINC and SRCDEC bits of the DMA global control register. At the end of 
a DMA write, the destination address is modified as specified by the DSTINC 
and DSTDEC bits of the DMA global control register. At the end of every DMA 
write, the DMA transfer counter is decremented. 

DMA on-chip reads and writes (reads and writes from on-chip memory and pe- 
ripherals) are single cycle. DMA off-chip reads are two cycles. The first cycle 
is the external read, and the second cycle loads the DMA register. The external 
read cycle is identical to a CPU read cycle. DMA off-chip writes are identical 
to CPU off-chip writes. If the DMA has been started and is transferring data 
over either external bus, the bus control register associated with that bus 
should not be modified. If the bus control register (see Chapter 7) needs to be 
modified, the DMA should be stopped, modification made, and then the DMA 
restarted. Failure to do so may produce an unexpected zero-wait-state bus ac- 
cess. 

Through the 24-bit source and destination registers, the DMA is capable of ac- 
cessing any memory-mapped location in the TMS320C3x memory map. 
Figure 8-35 through 8-34 show the number of cycles a DMA transfer requires, 
depending upon whether the source and destination are on-chip memory and 
peripherals, the external port, or the I/O port. Trepresents the number of trans- 
fers to be performed, C r represents the number of wait-states for the source 
read, and C w represents the number of wait-states for the destination write. 
Each entry in the table represents the total cycles required to do the T trans- 
fers, assuming that there are no pipeline conflicts. 

Accompanying each table is a figure illustrating the timing of the DMA transfer. 
|R| and |W| represent single-cycle reads and writes, respectively. |R.R| and 
|W.W| represent multicycle reads and writes. |Cr| and |Cw| show the number 
of wait cycles for a read and write. 
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Figure 8-35. Timing and Number of Cycles for DMA Transfers When Destination Is On-Chip 
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Figure 8-36. DMA Timing When Destination Is a Primary Bus 
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Figure 8-37. DMA Timing When Destination Is an Expansion Bus 
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Table 8-1 2 shows the maximum DMA transfer rates, assuming that there are 
no wait states (C r = Cw = 0). Table 8-13 shows the maximum DMA transfer 
rates, assuming there is one wait state for the read (C r = 1 ) and no wait states 
for the write (C^ = 0). Table 8-14 shows the maximum DMA transfer rates, 
assuming there is one wait state for the read (C r = 1 ) and one wait state for the 
write (£^=1). 

In each table, the time for the complete transfer (the read and the write) is con- 
sidered. Since one bus access is required for the read and another forthe write, 
internal bus transfer rates will be twice the DMA transfer rate. It is also as- 
sumed that no conflicts with the CPU exist. 



Table 8-12. Maximum DMA Transfer Rates When C r = C w = 







Source 




Destination 






Internal 


Primary 


Expansion 




Internal 


33.3 Mbytes/sec 


33.3 Mbytes/sec 


33.3 Mbytes/sec 




Primary 


22.2 Mbytes/sec 


1 6.7 Mbytes/sec 


22.2 Mbytes/sec 




Expansion 


22.2 Mbytes/sec 


22.2 Mbytes/sec 


16.7 Mbytes/sec 


Table 8-13. 


Ma, 


ximum DMA Transfer Rates When C r =1,C w = 











Source 




Destination 






Internal 


Primary 


Expansion 




Internal 


33.3 Mbytes/sec 


33.3 Mbytes/sec 


33.3 Mbytes/sec 




Primary 


16.7 Mbytes/sec 


1 3.3 Mbytes/sec 


1 6.7 Mbytes/sec 




Expansion 


1 6.7 Mbytes/sec 


16.7 Mbytes/sec 


1 3.3 Mbytes/sec 


Table 8-14. 


Ma 


ximum DMA Transfer Rates When C r =1,C w = 


/ 








Source 




Destination 






Internal 


Primary 


Expansion 




Internal 


33.3 Mbytes/sec 


22.2 Mbytes/sec 


22.2 Mbytes/sec 




Primary 


1 6.7 Mbytes/sec 


11.1 Mbytes/sec 


1 6.7 Mbytes/sec 




Expansion 


1 6.7 Mbytes/sec 


16.7 Mbytes/sec 


11.1 Mbytes/sec 



8-48 



Peripherals 



DMA Controller 



8.3.6 Synchronization of DMA Channels 



A DMA channel may be synchronized through the use of interrupts. Refer to 
Table 8-1 for the relationship between the SYNC bits of the DMA global con- 
trol register and the synchronization performed. This section describes the fol- 
lowing four synchronization mechanisms: 

Ql No synchronization (SYNC =0 0) 

Qi Source synchronization (SYNC = 1) 

□ Destination synchronization (SYNC = 1 0) 

Qi Source and destination synchronization (SYNC = 1 1) 

No Synchronization 

When SYNC = 0, no synchronization is performed. The DMA performs reads 
and writes whenever there are no conflicts. All interrupts are ignored and, 
therefore, are considered to be globally disabled. However, no bits in the DMA 
interrupt enable register are changed. Figure 8-38 shows the synchronization 
mechanism when SYNC = 0. 



Figure 8-38. No DMA Synchronization 

I Start 



I 



| Disable DMA Interrupts Globally 



I DMA Channel Performs a Read 

I 

I DMA Channel Performs a Write 

^x 



| Go to Start 



Source Synchronization 



When SYNC = 1 , the DMA is synchronized to the source (see Figure 8-39). 
A read will not be performed until an interrupt is received by the DMA. Then, 
all DMA interrupts are disabled globally. However, no bits in the DMA interrupt 
enable register are changed. 
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Figure 8-39. DMA Source Synchronization 
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Destination Synchronization 

When SYNC = 1 0, the DMA is synchronized to the destination. First, all inter- 
rupts are ignored until the read is complete. Though the DMA interrupts are be 
considered to be globally disabled, no bits in the DMA interrupt enable register 
are changed. A write will not be performed until an interrupt is received by the 
DMA. Figure 8-40 shows the synchronization mechanism when SYNC = 1 0. 



Figure 8-40. DMA Destination Synchronization 
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Source and Destination Synchronization 

When SYNC = 1 1 , the DMA is synchronized to both the source and destina- 
tion. A read is performed when an interrupt is received. A write is performed 
on the following interrupt. Source and destination synchronization when SYNC 
= 1 1 is shown in Figure 8-41 . 
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Figure 8-4 1. DMA Source and Destination Synchronization 
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8.3.7 DMA Interrupts 



A DMA interrupt to the CPU may be generated whenever the transfer count 
reaches zero, indicating that the last transfer has taken place. The TCINT bit 
in the DMA global control register determines whether the interrupt will be gen- 
erated. If TCINT = 1 , the DMA interrupt is generated. If TCINT = 0, the DMA 
interrupt is not generated. If the DMA interrupt is generated, the EDINT bit, bit 
1 in the interrupt enable register, must also be set to enable the CPU to be 
interrupted by the DMA. 

A second bit in the DMA global control register, the TC bit, is also generally as- 
sociated with the state of the TCINT bit and the interrupt operation. The TC bit 
determines if transfers are terminated when the transfer counter becomes zero 
or if they are allowed to continue. If TC = 1 , transfers are terminated when the 
transfer count becomes zero. If TC = 0, transfers are not terminated when the 
transfer count becomes zero. 

In general, if TCINT is then TC should also be set to 0. Otherwise, the DMA 
transfer will terminate and the CPU will not be notified. If TCINT is 1 then in 
most cases TC should also be 1 . In this case, the CPU will be notified when 
the transfer completes and the DMA will be halted and ready to start a new 
transfer. 
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8.3.8 DMA Setup and Use Examples 



Transfer a 256-word block of data from off-chip memory to on-chip memory 
and generate an interrupt on completion. The order of memory is to be main- 
tained. 

DMA source address: 800000h 

DMA destination address: 809800h 

DMA transfer counter: 000001 OOh 

DMA global control: 00000C53h 

CPU/DMA interrupt enable (IE): 00000400h 

Transfer a 128-word block of data from on-chip memory to off-chip memory 
and generate an interrupt on completion. The order of memory is to be in- 
verted, i.e., the highest addressed member of the block is to become the lowest 
addressed member. 

DMA source address: 809800h 

DMA destination address: 800000h 

DMA transfer counter: 00000080h 

DMA global control: 00000C93h 

CPU/DMA interrupt enable (IE): 00000400h 

Transfer a 200-word block of data from the serial port receive register to on- 
chip memory and generate an interrupt on completion. The transfer is to be 
synchronized with the serial port receive interrupt. 

DMA source address: 80804Ch 

DMA destination address: 809C00h 

DMA transfer counter: 000000C8h 

DMA global control: 00000D43h 

CPU/DMA interrupt enable (IE): 00200400h 

Transfer a 200-word block of data from off-chip memory to the serial port 
transmit register and generate an interrupt on completion. The transfer is to be 
synchronized with the serial port transmit interrupt. 

DMA source address: 809C00h 

DMA destination address: 808048h 

DMA transfer counter: 000000C8h 

DMA global control: 00000E13h 

CPU/DMA interrupt enable (IE): 00400400h 
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Transfer data continuously between the serial port receive register and the 
serial port transmit register to create a digital loop back. The transfer is to be 
synchronized with the serial port receive and transmit interrupts. 

DMA source address: 80804Ch 

DMA destination address: 808048h 

DMA transfer counter: OOOOOOOOh 

DMA global control: 00000303h 

CPU/DMA interrupt enable (IE): 00600000h 



8.3.9 DMA Initialization/Reconfiguration 



The DMA is controlled through memory-mapped registers located on the dedi- 
cated peripheral bus. A general procedure for initializing and/or reconfiguring 
the DMA follows: 

1 ) Halt the DMA by clearing the START bits of the DMA global-control regis- 
ter. This can be accomplished by writi ng a to the DMA global-control reg- 
ister. Note that the DMA is halted on RESET. 

2) Configure the DMA via the DMA global-control register (with START = 00), 
as well as the DMA source, destination, and transfer-counter registers, if 
necessary. Refer to subsection 8.3.8, "DMA Setup and Use Examples." 

3) Start the DMA by setting the START bits of the DMA global-control register 
as necessary. 
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Chapter 9 



Two characteristics of the TMS320C3x that contribute to its high performance 
are pipelining and concurrent I/O and CPU operation. 

Five functional units control TMS320C3x operation: fetch, decode, read, ex- 
ecute, and DMA. Pipelining is the overlapping or parallel operations of the 
fetch, decode, read, and execute levels of a basic instruction. 

By performing input/output operations, the DMA controller reduces the need 
for the CPU to do so, thereby decreasing pipeline interference and enhancing 
the CPU's computational throughput. 

Major topics discussed in this chapter are as follows: 

Q Pipeline Structure (Section 9.1 on page 9-2) 

Q Pipeline Conflicts (Section 9.2 on page 9-4) 
□ Branch conflicts 
Q Register conflicts 
E Memory conflicts 

Q Resolving Memory Conflicts (Section 9.4 on page 9-1 9) 

Qi Clocking of Memory Accesses (Section 9.5 on page 9-21 ) 
n Program fetches 
a Data loads and stores 
Q DMA accesses 



9-1 



Pipeline Structure 



9.1 Pipeline Structure 



The five major units of the TMS320C3x pipeline structure and their functions 
are as follows: 



Fetch Unit (F) 
Decode Unit (D) 

Read Unit (R) 
Execute Unit (E) 



Fetches the instruction words from memory and 
updates the program counter (PC). 

Decodes the instruction word and performs ad- 
dress generation. Also controls any modification of 
the auxiliary registers and the stack pointer. 

If required, reads the operands from memory. 

If required, reads the operands from the register 
file, performs the necessary operation, and writes 
results to the register file. If required, results of pre- 
vious operations are written to memory. 

DMA Channel (DMA) Reads and writes memory. 

A basic instruction has four levels: fetch, decode, read, and execute. 
Figure 9-1 illustrates these four levels of the pipeline structure. The levels are 
indexed according to instruction and execution cycle. The perfect overlap in 
the pipeline, where all four units operate in parallel, occurs at cycle (m). Those 
levels about to be executed are at m +1 , and those just executed are at m-1 . 
The TMS320C3x pipeline control allows a high-speed execution rate of one ex- 
ecution per cycle. It also manages pipeline conflicts so that they are transpar- 
ent to the user. You do not need to take any special precautions to guarantee 
correct operation. 
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Figure 9-1. TMS320C3x Pipeline Structure 



Pipeline Structure 



CYCLE 

m-3 

m-2 

m-1 

m 

m+1 

m+2 

m+3 



w <- 

X 
Y 
Z 



Perfect overlap 



Notes: 1) W, X, Y, and Z represent instructions. 

2) F, D, R, E = fetch, decode, read, and execute, respectively. 

Priorities from highest to lowest have been assigned to each of the functional 
units as follows: 

Q Execute (highest) 

Q Read 

Qi Decode 

Q Fetch 

□ DMA (lowest). 

When the processing of an instruction is ready to pass to the next higher pipe- 
line level, but that level is not ready to accept a new input, a pipeline conflict 
occurs. In this case, the lower priority unit waits until the higher priority unit 
completes its currently executing function. 

Despite the DMA controllers low priority, conflicts with the CPU can be mini- 
mized or even eliminated by suitable data structuring because the DMA con- 
troller has its own data and address buses. 
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9.2 Pipeline Conflicts 



The pipeline conflicts of the TMS320C3x can be grouped into the following 
main categories: 

Branch Conflicts Involve most of those instructions or operations 

that read and/or modify the PC. 

Register Conflicts Involve delays that can occur when reading from or 

writing to registers that are used for address gen- 
eration. 

Memory Conflicts Occur when the internal units of the TMS320C3x 

compete for memory resources. 

Each of these three types is discussed in the following sections. Examples are 
included. Note in these examples, when data is refetched or an operation is 
repeated, the symbol representing the stage of the pipeline is appended with 
a number. For example, if afetch is performed again, the instruction mnemonic 
is repeated. When an access is detained multiple cycles because of not ready, 
the symbols RDY and RDY are used to indicate not ready and ready, respec- 
tively. 



9.2.1 Branch Conflicts 



The first class of pipeline conflicts occurs with standard (non-delayed) 
branches, i.e., BR, Bcond, DBcond, CALL, IDLE, RPTB, RPTS, RETIcond, 
RETScond, interrupts, and reset. Conflicts arise with these instructions and 
operations because during their execution, the pipeline is used only for the 
completion of the operation; other information fetched into the pipeline is dis- 
carded or refetched, or the pipeline is inactive. This is referred to as flushing 
the pipeline. Flushing the pipeline is necessary in these cases to guarantee 
that portions of succeeding instructions do not inadvertently get partially ex- 
ecuted. TRAPcond and CALLcond are classified differently from the other 
types of branches and are considered later. 

Example 9-1 shows the code and pipeline operation for a standard branch. 
Note that one dummy fetch is performed (MPYF instruction), and then after the 
branch address is available, a new fetch (OR instruction) is performed. This 
dummy fetch affects the cache. 
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Example 9-1. Standard Branch 



BR 

MPYF 

ADD 

SUBF 

AND 



THREE 



Unconditional branch 
Not executed 
Not executed 
Not executed 
Not executed 



THREE ■ OR 
STI 



; Fetched after BR is fetched 



PIPELINE OPERATION 



PC 

n 

n+1 

n+1 

n+1 

THREE 



/ 



F 

BR 
MPYF 



(nop) 
(nop) 



OR 
STI 



THREE -» PC 



Fetch held for 
new PC value 



BR 
(nop) 
(nop) 
(nop) 

OR 



BR 
(nop) 
(nop) 
(nop) 



BR 
(nop) 
(nop) 



RPTS and RPTB both flush the pipeline, allowing the RS, RE, and RC registers 
to be loaded at the proper time relative to the flow of the pipeline. If these regis- 
ters are loaded without the use of RPTS or RPTB, no flushing of the pipeline 
occurs. If none of the repeat modes are being used, RS, RE, and RC may be 
used as general-purpose 32-bit registers without any pipeline conflicts occur- 
ring. In cases such as the nesting of RPTB due to nested interrupts, it may be 
necessary to load and store these registers directly while using the repeat 
modes. Since up to four instructions can be fetched before entering the repeat 
mode, loads should be followed by a branch to flush the pipeline. If the RC is 
changing when an instruction is loading it, the direct load takes priority over the 
modification made by the repeat mode logic. 
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Delayed branches are implemented to guarantee the fetching of the next three 
instructions. The delayed branches include BRD, BcondD, and DBconoD. 
Example 9-2 shows the code and pipeline operation for a delayed branch. 



Example 9-2. Delayed Branch 



BRD THREE ; Unconditional delayed branch 
MPYF ; Executed 

ADD ; Executed 

SUBF / Executed 

AND ; Not executed 



THREE MPYF 



; Fetched after SUBF is fetched 



PIPELINE OPERATION 



PC 

n 

n+1 

n+2 

n+3 



BRD 



MPYF BRD 



ADDF MPYF BRD 



SUBF ADDF MPYF BRD 



No execute delay 



^JHREE 



THREE -» PC 



9.2.2 Register Conflicts 



MPYF SUBF ADDF MPYF 



Register conflicts involve the reading or writing of registers used for addressing 
purposes. These conflicts occur when the pertinent register is not ready to be 
used. Some conditions under which register conflicts can be avoided are dis- 
cussed in Section 9.3. 

The registers compose the following three functional groups: 
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Pipeline Conflicts 



Group 1 Auxiliary registers (ARO — AR7), index registers (IRO, IR1), and 
block size register (BK) 

Group 2 Data-page pointer (DP) 

Group 3 System stack pointer (SP) 

If an instruction writes to one of these three groups, the decode unit cannot use 
any register within that particular group until the write is complete, i.e., instruc- 
tion execution is completed. In Example 9-3, an auxiliary register is loaded, 
and a different auxiliary register is used on the next instruction. Since the de- 
code stage needs the result of the write to the auxiliary register, the decode of 
this second instruction is delayed two cycles. Every time the decode is delayed, 
a refetch of the program word is performed; i.e., the ADDF is fetched three 
times. Since these are actual refetches, they can cause not only conflicts with 
the DMA controller but also cache hits and misses. 



Example 9-3. Write to an AR Followed by an AR for Address Generation 



NEXT 



LDI 
MPYF 
ADDF 
FLOAT 



7 , AR1 ; 7 -> AR1 

*AR2,R0 ; Decode delayed 2 cycles 



PIPELINE OPERATION 



PC 

n 

n+1 

n+2 

n+2 

n+2 

n+3 



F 


' D 


LDI 


- 


MPYF 


LDI 


ADDF 


MPYF 


ADDF 


MPYF 



LDI 

(nop) LDI 7,AR1 
ADDF MPYF (nop) (nop) 
FLOAT ADDF MPYF (nop) 



Decode/address 
generation held 
for a new AR value 

AR1 loaded 



The case for reads of these groups is similar to the case for writes. If an 
instruction must read a member of one of these groups, the use of that particu- 
lar group by the decode for the following instruction is delayed until the read 
is complete. The registers are read at the start of the execute cycle and there- 
fore require only a one-cycle delay of the following decode. For four registers 
(IRO, IR1 , BK, or DP) no delay is incurred. In all other cases, including the SP 
the delay occurs. 
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In Example 9-4, two auxiliary registers are added together with the result go- 
ing to an extended-precision register. The next instruction uses a different aux- 
iliary register as an address register. 



Example 9-4. A ReadofARs Followed by ARs for Address Generation 



ADDI AR0,AR1,R1 ; ARO + AR1 -> Rl 
NEXT MPYF *++AR2,R0 /Decode delaved 1 cycle 

ADDF 
FLOAT 



PIPELINE OPERATION 



PC 


I F 


D 


n 


ADDI 


- 


n+1 


MPYF 


ADDI 


n+2 


ADDF • 


MPYF 


n+2 


ADDF 


MPYF 


n+3 


FLOAT 


ADDF 




ADDI 



(nop) ADDI AR0,AR1,R0 



Decode/address 
generation held 
until AR is read 

ARs read 



The DBR (decrement and branch) instruction's use of auxiliary registers for 
loop counters is treated the same as if the use were for addressing. Therefore, 
the operation shown in the two previous examples can also occur for this in- 
struction. 
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9.2.3 Memory Conflicts 



Memory conflicts can occur when the memory bandwidth of a physical memory 
space is exceeded. For example, RAM blocks and 1 and the ROM block can 
support only two accesses every cycle. The external interface can support only 
one access per cycle. Some conditions under which memory conflicts can be 
avoided are discussed in Section 9.4. 

Memory pipeline conflicts consist of the following four types: 

Program Wait A program fetch is prevented from begin- 

ning. 

Program Fetch Incomplete A program fetch has begun but is not yet 

complete. 

Execute Only An instruction sequence requires three CPU 

data accesses in a single cycle. 

Hold Everything A primary or expansion bus operation must 

complete before another one can proceed. 

These four types of memory conflicts are illustrated in exam pies and discussed 
in the paragraphs that follow. 

Program Wait 

Two conditions can prevent the program fetch from beginning: 

□ The start of a CPU data access when 

■ Two CPU data accesses are made to an internal RAM or ROM block, 
and a program fetch from the same block is necessary. 

■ One of the external ports is starting a CPU data access, and a program 
fetch from the same port is necessary. 

Qi A multicycle CPU data access or DMA data access over the external bus 
is needed. 
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Example 9-5 illustrates a program wait until a CPU data access completes. In 
this case, *AR0 and *AR1 are both pointing to data in RAM block 0, and the 
MPYF instruction will be fetched from RAM block 0. This results in the conflict 
shown. Since no more than two accesses can be made to RAM block in a 
single cycle, the program fetch cannot begin and must wait until the CPU data 
accesses are complete. 



Example 9-5. Program Wait Until CPU Data Access Completes 







ADDF3 

FIX 

MPYF 

ADDF3 

NEGB 


*AR0,*AR1,R0 








PIPELINE OPERATION 






PC 


| F | D 


| R | B 






n 


ADDF3 


- 




Fetch held until 
ARs are read 


n+1 


FIX ADDF3 


— ^ ' 




- ARs read 


n+2 


(WAIT) 


FIX 


ADDF3 < ="""" ~~ 






n+2 


MPYF 


(nop) 


FIX ADDF3 


*AR0,AR1,R0 




n+3 


ADDF3 MPYF 


(nop) FIX 






n+4 


NEGB 


ADDF3 


MPYF (nop) 







Example 9-6 shows a program wait due to a multicycle data-data access or 
a multicycle DMA access. The ADDF, MPYF, and SUBF are fetched from some 
portion in memory other than the external port the DMA requires. The DMA be- 
gins a multicycle access. The program fetch corresponding to the CALL is 
made to the same external port the DMA is using. 

Either of two cases may produce this situation: 

Qi Crossing one of two memory boundaries 

■ from 7F FFFFh to 80 OOOOh, or 

■ from 80 9FFFh to 80 AOOOh. 

Q Code is executed that has been cached, and the instruction prior to the 
addf is one of the following (conditional or unconditional): 

■ a delayed branch instruction, or 

■ a delayed decrement and branch instruction 

Even though the DMA has the lowest priority, multicycle access cannot be 
aborted. The program fetch must therefore wait until the DMA access com- 
pletes. 
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Example 9-6. Program Wait Due to Multicycle Access 



PC 

n 

n+1 

n+2 

n+3 

n+3 

n+4 



PIPELINE OPERATION 

F | D | R | E 

ADDF - 
MPYF ADDF 

SUBF MPYF ADDF 

(WAIT) SUBF MPYF ADDF 

CALL (nop) SUBF MPYF 

CALL (nop) SUBF 



2-cycle DMA access 



Program Fetch Incomplete 

A program fetch incomplete occurs when a program fetch takes more than one 
cycle to complete due to wait states. In Example 9-7, the MPYF and ADDF are 
fetched from memory that supports single-cycle accesses. The SUBF is 
fetched from memory requiring one wait state. One example that demonstrates 
this conflict is a fetch across a bank boundary on the primary port. See Sec- 
tion 7.4. 



Example 9-7. Multicycle Program Memory Fetches 





PIPELINE OPERATION 


PC 


I F 


I D 


| R | E 


n 


MPYF 


- 


- 


n+1 


ADDF 
SUBF 


MPYF 
ADDF 


- 


n+2 RDY 


MPYF - 


n+2 RDY 


SUBF 


(nop) 


ADDF MPY 


n+3 


ADDI 


SUBF 


(nop) ADD 



1 wait state required 

1 
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Execute Only 

The Execute Only type of memory pipeline conflict occurs when a sequence 
of instructions requires three CPU data accesses in a single cycle or when per- 
forming an interlocked load. There are three cases in which this occurs: 

□ An instruction performs a store and is followed by an instruction that does 
two memory reads. 

Qi An instruction performs two stores and is followed by an instruction that 
performs at least one memory read. 

Qi An interlocked load (LDII or LDFI) instruction is performed, and XF1 = 1 . 

The first case is shown in Example 9-8. Since this sequence requires three 
data memory accesses and only two are available, only the execute phase of 
the pipeline is allowed to proceed. The dual reads required by the LDF || LDF 
is delayed one cycle. Note that a refetch of the next instruction can occur. 



Example 9-8. Single Store Followed by Two Reads 



STF R0,*AR1 
LDF *AR2,R1 
LDF *AR3,R2 



; RO — > *AR1 

; *AR2 — > Rl in parallel with 

; *AR3 -> R2 



PC 


| F 


I D 


n 


STF 


- 


n+1 


LDF| I LDF 


STF 


n+2 


W 


LDFl I LDF 


n+3 


X 


W 


n+4 


X 


W 


n+4 


Y 


X 



PIPELINE OPERATION 

R I E 



STF 



Write must complete 



K before the two 

ldfIIldf stf ro,*ari reads can complete. 

LDFll LDF (nop) 

W LDFIILDF *AR2,R1 and *AR3,R2 
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Example 9-9 shows a parallel store followed by a single load or read. Since 
the two parallel stores are required, the next CPU data memory read must wait 
a cycle before beginning. One program memory refetch may occur. 



Example 9-9. Parallel Store Followed by Single Read 



STF R0,*AR0' 
STF R2,*AR1 
ADDF @SUM,R1 
IACK 
ASH 



; RO — > *AR0 in parallel with 

; R2 -> *AR1 

; Rl + @SUM — > Rl 



PC 


I F 


n 


stfIIstf 


n+1 


ADDF 


n+2 


IACK 


n+3 


ASH 


n+4 


ASH 


n+4 


_ 



PIPELINE OPERATION 
D | R | 

STFll STF - 



ADDF 



IACK 



IACK 



ASH 



STFll STF 



Read must 



wait until the 
writes are complete 



ADDF >/STF|| STF R0,*AROand R2,*AR1 
ADDF (nop) 



IACK 



ADDF 
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The final case involves an interlocked load (LDII or LDFI) instruction and XF1 
= 1 . Since the interlocked loads use the XF1 pin as an acknowledge that the 
read can complete, they may need to extend the read cycle, as shown in 
Example 9-1 0. Note that a program refetch may occur. 



Example 9-10. Interlocked Load 



NOT R1,R0 

LDII 300h,AR2 

ADD I *AR2,R2 

CMP I R0,R2 



PC 

n 

n+1 

n+2 

n+3 

n+3 

n+4 



NOT 



LDII 



ADD I 



CMP I 



PIPELINE OPERATION 
D | R | E 

NOT - - 



LDII 



ADD I 



CMP I 



CMP I 



NOT 



LDII 



ADD I 



ADD I 




LDII 



LDII 
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Hold Everything 

There are three types of Hold Everything memory pipeline conflicts: 

Qi A CPU data load or store cannot be performed because an external port 
is busy. 

Qi An external load takes more than one cycle. 

Qi Conditional calls and traps. 

The first type of Hold Everything conflict occurs when one of the external ports 
is busy due to an access that has started but is not complete. In Example 9-1 1 , 
the first store is a two-cycle store. The CPU writes the data to an external port. 
The port control then takes two cycles to complete the data-data write. The 
LDF is a read over the same external port. Since the store is not complete, the 
CPU continues to attempt LDF until the port is available. 



Example 9-11. Busy External Port 



STF R0,@DMA1 
LDF @DMA2,R0 



PIPELINE OPERATION 

PC | F | D | R | E 

n stf - - - 

n+1 LDF STF - - 

n+2 W LDF STF 

n+2 W LDF (nop) STF 



T 



2-cycle external bus 

n+2 w ldf (nop) (nop) ^ write access 

n+3 X W LDF (nop) 

n+4 Y X W LDF 

The second type of Hold Everything conflict involves multicycle data reads. 
The read has begun and continues until completed. In Example 9-1 2, the LDF 
is performed from an external memory that requires several cycles to com- 
plete. 
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Example 9- 12. Multicycle Data Reads 

LDF @DMA,RO 





PIPELINE OPERATION 


PC 


I F I 


D 


| R | E 


n 


LDF 


- 


- 


n+1 


I 


LDF 


- 


n+2 


J 


I- 


LDF 


n+3 


K , (dummy) 


I 


LDF - 


n+3 


K 2 


J 


I LDI 



T 

2-cycle external bus 
^ read access 



The final type of Hold Everything conflict deals with conditional calls and traps, 
which are different from the other branch instructions. Whereas the other 
branch instructions are conditional loads, the conditional calls and traps are 
conditional stores, which take one cycle more than a conditional branch (see 
Example 9-13). The added cycle is used to push the return address after the 
call condition is evaluated. 

Example 9-13. Conditional Calls and Traps 

PIPELINE OPERATION 

PC | F | D | R | E | 

n CALLcond - - - 

n+1 I CALLcond - - 

n+1 (nop) (nop) CALLcond — 

n+1 (nop) (nop) (nop) CALLcond 

PC store 

n+1 (nop) (nop) (nop) CALLcond ~T~ Cycle 

I 

n+2/CALLaddr i (nop) (nop) (nop) 
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9.3 Resolving Register Conflicts 



If the auxiliary registers (AR7-AR0), the index registers (IR1-IR0), data page 
pointer (DP), or stack pointer (SP) is accessed for any reason other than ad- 
dress generation, pipeline conflicts associated with the next memory access 
may occur. The pipeline conflicts and delays were presented in subsection 
9.2.2. 

The following examples, Example 9-1 4 through Example 9-16, demonstrate 
some common uses of these registers that do not produce a conflict or ways 
that the conflict can be avoided. 

Example 9-14. Address Generation Update of an AR Followed by an AR for Address Generation 



PC 


I F 


n 


LDF 


n+1 


MPYF 


n+2 


ADDF 


n+3 


FIX 


n+4 


MPYF 


n+5 


ADDF 



LDF 


7.0,R0 


; 7.0 


— > 


RO 


MPYF 


*++AR0 (IR1) , 


. RO 






ADDF 


*AR2 , RO 








FIX 










MPYF 










ADDF 










PIPELINE OPERATION 








D 


I n I 


E 


I 




LDF 


^^ 


- 







MPYF 



ADDF 



FIX 



MPYF 



LDF, 
MPYF 
ADDF 

FIX 



Address generation and AR update 
Address generation 



LDF 



MPYF 



ADDF 
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Example 9-15. Write toanAR Followed byanAR for Address Generation Without a Pipeline Conflict 



PC 

n 

n+1 

n+2 

n+3 

n+4 

n+5 



F 

LDI 
MPYF 
ADDF 
MPYF 
SUBF 

STF 



LDI STABLE, AR2 

MPYF @ VALUE , Rl 

ADDF R2 , Rl 

MPYF *AR2++,R1 
SUBF 
STF 

PIPELINE OPERATION 




MPYF 



SUBF 



ADDF 



MPYF 



No AR address 
generation done 
for these two 
instructions 

AR2 used for 
address generation 



AR2 loaded 



ADDF 



Example 9-16. Write to DP Followed by a Direct Memory Road Without a Pipeline Conflict 



PC 

n 

n+1 

n+2 

n+3 

n+4 

n+5 



LDP 



POP 



LDF 



LDI 



PUSHF 



PUSH 



LDP TABLE_ADDR 

POP RO 

LDF *-AR3(2),Rl 

LDI @ T ABLE_ADDR , ARO 

PUSHF R6 

PUSH R4 

PIPELINE OPERATION 

D I R I E 



LDP 



POP 



LDF 



LDI 



PUSHF- 



LDP 



POP 



LDF 



LDI 



LDP 



POP 



LDF 



DP loaded 
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9.4 Resolving Memory Conflicts 

If program fetches and data accesses are performed in such a manner that the 
resources being used cannot provide the necessary bandwidth, the program 
fetch is delayed until the data access is complete. Certain configurations of 
program fetch and data accesses yield conditions under which the 
TMS320C3x can achieve maximum throughput. 

Table 9-1 shows how many accesses can be performed from the different 
memory spaces when it is necessary to do a program fetch and a single data 
access, and still achieve maximum performance (one cycle). Four cases 
achieve one-cycle maximization. 

Table 9-1. One Program Fetch and One Data Access for Maximum Performance 



Case# 


Primary Bus 
Accesses 


Accesses From 

Dual-Access 
Internal Memory 


Expansion Bust 

Or Peripheral 

Accesses 


1 


1 


1 


- 


2 


1 


- 


1 


3 


- 


2 from any 

combination 

of internal memory 


- 


4 


- 


1 


1 



t Expansion bus only available on TMS320C30. 
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Table 9-2 shows how many accesses can be performed from the different 
memory spaces when it is necessary to do a program fetch and two data ac- 
cesses, still achieving maximum performance (one cycle). Six cases achieve 
this maximization. 

Table 9-2. One Program Fetch and Two Data Accesses for Maximum Performance 



Case# 


Primary Bus 
Accesses 


Accesses From 

Dual-Access 
Internal Memory 


Expansiont Or 

Peripheral Bus 

Accesses 










1 


1 


2 from any 

combination 

of internal memory 


- 


2t 


1 Program 


1 Data 


1 Data 


3t 


1 Data 


1 Data 


1 Program 


4 


- 


2 from same internal 

memory block and 

1 from a different 

internal memory 

block 


- 


5 


- 


3 from different 

internal memory 

blocks 


- 


6 


- 


2 from any 

combination 

of internal memory 


1 



t Expansion bus available only on TMS320C30. 
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9.5 Clocking of Memory Accesses 



Internal clock phases (H1 and H3) and their relationship to memory accesses 
are discussed in this section to show how the TMS320C3x handles multiple 
memory accesses. Whereas the previous section discussed the interaction 
between sequences of instructions, this section discusses the flow of data on 
an individual instruction basis. 

Each major clock period of 60 ns is composed of two minor clock periods of 
30 ns, labeled H3 and H1 . The active clock period for H3 and H1 is the time 
when that signal is high. 

Major Clock Period 



H1 



H3 



The precise operation of memory reads and writes can be defined according 
to these minor clock periods. The types of memory operations that can occur 
are program fetches, data loads and stores, and DMA accesses. 



9.5.1 Program Fetches 



Internal program fetches are always performed during H3 unless a single data 
store must occur at the same time due to another instruction in the pipeline. 
In this case, the program fetch occurs during H1 and the data store during H3. 

External program fetches always start at the beginning of H3 with the address 
being presented on the external bus. At the end of H 1 , they are completed with 
the latching of the instruction word. 
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9.5.2 Data Loads and Stores 



Four types of instructions perform loads, memory reads, and stores: two-oper- 
and instructions, three-operand instructions, multiplier/ALU operation with 
store instructions, and parallel multiply and add instructions. See Chapter 5 for 
detailed information on addressing modes. 

As discussed in Chapter 7, the number of bus cycles for external memory 
accesses differs in some cases from the number of CPU execution cycles. For 
external reads, the number of bus cycles and CPU execution cycles is identi- 
cal. For external writes, there are always at least two bus cycles, but unless 
there is a port access conflict, there is only one CPU execution cycle. In the 
following examples, any difference in the number of bus cycles and CPU cycles 
is noted. 

Two-Operand Instruction Memory Accesses 

Two-operand instructions include all those instructions with bits 31 — 29 being 
000 or 01 (see Figure 9-2). In the case of a data read, bits 15 — represent 
the srcoperand. Internal data reads are always performed during H1 . External 
data reads always start at the beginning of H3 with the address being pres- 
ented on the external bus, and they complete with the latching of the data word 
at the end of H1. 

In the case of a data store, bits 15 — represent the dst operand. Internal data 
stores are performed during H3. External data stores always start at the begin- 
ning of H3 with the address and data being presented on the external bus. 



Figure 9-2. Two-Operand Instruction Word 



31 



T — I - 

0X0 



24 23 



t — i — i — i — r 

Operation 



T — I — I — 

dst(src) 



1615 



t — i—i — i — i — i — i — i — i — i — i — i — i — i — r 

src(dst) 



Three-Operand Instruction Memory Reads 

Three-operand instructions include all instructions with bits 31 — 29 being 001 
(see Figure 9-3). The source operands, srd and src2, come from either regis- 
ters or memory. When one or more of the source operands are from memory 
these instructions are always memory reads. 

If only one of the source operands is from memory (either srd or src2) and is 
located in internal memory, the data is read during H1 . If the single memory 
source operand is in external memory, the read starts at the beginning of H3, 
with the address being presented on the external bus, and completes with the 
latching of the data word at the end of H1 . 
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If both source operands are to be fetched from memory, then several cases oc- 
cur. If both operands are located in internal memory, the srd read is performed 
during H3 and src2during H1 , thus completing two memory reads in a single 
cycle. 

If srd is in internal memory and src2 is in external memory, the src2 access 
begins at the start of H3 and latches at the end of H1 . At the same time, the 
srd access to internal memory is performed during H3. Again, two memory 
reads are completed in a single cycle. 

If srd is in external memory and src2\s in internal memory, two cycles are nec- 
essary to complete the two reads. In the first cycle, the internal src2 access is 
performed. The srd is also performed, but not latched until the next H3. 

If srd and src2 are both from external memory, two cycles are required to com- 
plete the two reads. In the first cycle, the srd access is performed and loaded 
on the next H3; in the second cycle, the src2 access is performed and loaded 
on that cycle's H1. 



Figure 9-3. Three-Operand Instruction Word 



31 



T — [— 

1 



t — i — i — r 

Operation 



24 23 

— T 



16 15 



t — i — r 

dst 



t — i — i — i — i — i — r 

srd 



87 



t — i — i — i — r 

src2 



Operations with Parallel Stores 

The next class of instructions includes all instructions that have a store in paral- 
lel with another instruction. Bits 31 and 30 for these instructions are equal to 
1 1. 

For those operations that perform a multiply or ALU operation in parallel with 
a store, the instruction word format is shown in Figure 9-4. If the store opera- 
tion to dst2 is external or internal, it is performed during H3. Two bus cycles 
are required for external stores, but only one CPU cycle is necessary to com- 
plete the write. 

If the memory read operation is external, it starts at the beginning of H3 and 
latches at the end of H1 . If the memory read operation is internal, it is performed 
during H1 . Note that memory reads are performed by the CPU during the read 
(R) phase of the pipeline, and stores are performed during the execute (E) 
phase. 
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Figure 9-4. Multiply or CPU Operation With a Parallel Store 



31 



t — i — i — r 

Operation 



24 23 



1615 



87 



1 1 



i r 

dstl 



srd 



src3 



t — i — i — i — i — i — r 

dst2 



t — i — i — i — i — i — r 

src2 



The instruction word format for those instructions that have parallel stores to 
memory is shown in Figure 9-5. If both destination operands, dstl and dst2, 
are located in internal memory, dstl is stored during H3 and dst2 during H1, 
thus completing two memory stores in a single cycle. 

If dstl is in external memory and dst2\s in internal memory, the dstl store be- 
gins at the start of H3. The dst2 store to internal memory is performed during 
H1 . Two bus cycles are required for the external store, but only one CPU cycle 
is necessary to complete the write. Again, two memory stores are completed 
in a single cycle. 

If dstl is in internal memory and dst2 is in external memory, an additional bus 
cycle is necessary to complete the dst2 store. Only one CPU cycle is necessary 
to complete the write, but the port access requires three bus cycles. In the first 
cycle, the internal dstl store is performed during H3, and ds\2 is written to the 
port during H1 . During the next cycle, the dst2 store is performed on the exter- 
nal bus, beginning in H3, and executes as normal through the following cycle. 

If dstl and dst2 are both written to external memory, a single CPU cycle is still 
all that is necessary to complete the stores. In this case, four bus cycles are 
required. 

1 ) In the first cycle, both dst 1 and dst2are written to the port, and the external 
bus access for dstl begins. 

2) The store for dstl is completed on the second cycle, and the store for dst2 
begins on the third external bus cycle. 

3) Finally, the store for dst2 is completed on the fourth external bus cycle. 
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Figure 9-5. Two Parallel Stores 



Clocking of Memory Accesses 



31 



24 23 



16 15 



87 



t — i — i — i — i — i — r 

dst2 



1 1 



t — i — i — r 

ST||ST 



t — r 

src2 



—1 — I— 




t — r 

srd 



t — i — i — i — i — r 

dstl 



Parallel Multiplies and Adds 

Memory addressing for parallel multiplies and adds is similar to that for three- 
operand instructions. The parallel multiplies and adds include all instructions 
with bits 31—30 equal to 1 (see Figure 9-6). 

For these operations, src3 and src4 are both located in memory. If both oper- 
ands are located in internal memory, src3 is performed during H3, and src4 is 
performed during H1 , thus completing two memory reads in a single cycle. 

If src3 is in internal memory and src4 is in external memory, the src4 access 
begins at the start of H3 and latches at the end of H1 . At the same time, the 
src3 access to internal memory is performed during H3. Again, two memory 
reads are completed in a single cycle. 

If src3 is in external memory and src4 is in internal memory, two cycles are nec- 
essary to complete the two reads. In the first cycle, the internal src4 access is 
performed. During the H3 of the next cycle, the src3 access is performed. 

If src3 and src4 are both from external memory, two cycles are necessary to 
complete the two reads. In the first cycle, the src3 access is performed; in the 
second cycle, the src4 access is performed. 



Figure 9-6. Parallel Multiplies and Adds 
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Chapter 10 

Assembly Language instructions 



The TMS320C3x assembly language instruction set supports numeric-intensive, signal 
processing, and general-purpose applications. The instructions are organized into major 
groups consisting of load-and-store, two- or three-operand arithmetic/logical, parallel, 
program control, and interlocked operations instructions. The addressing modes used 
with the instructions are described in Chapter 5. 

The TMS320C3x instruction set can also use one of 20 condition codes with any of the 
10 conditional instructions, such as LDFcond. This chapter defines the condition codes 
and flags. 

The assembler allows optional syntax forms to simplify the assembly language for spe- 
cial-case instructions. These optional forms are listed and explained. 

Each of the individual instructions is described and listed in alphabetical order. An exam- 
ple instruction (see pages 10-13 through 10-15) demonstrates the special format used 
and explain its content. 

This chapter discusses the following major topics: 

Ql Instruction Set (Section 10.1 on page 10-3) 

e Load-and-store instructions 

E Two-operand arithmetic/logical instructions 

h Three-operand arithmetic/logical instructions 

Program control instructions 

■ Interlocked operations instructions 

H Parallel operations instructions 
Ql Condition Codes and Flags (Section 10.2 on 10-9) 
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Individual Instructions (Section 10.3 on page 10-12) 

■ Symbols and abbreviations used in instructions 

■ Optional assembler syntaxes 

■ Individual instruction descriptions alphabetized and including 

Syntax 

Operation 

Operands 

Encoding 

Description 

Cycles 

Status bits 

Mode bit 

Example(s) 
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10.1 Assembly Language Instructions — Instruction Set 

The TMS320C3x instruction set is exceptionally well-suited to digital signal processing 
and other numeric-intensive applications. All instructions are a single machine word long, 
and most instructions take a single cycle to execute. In addition to multiply and accumulate 
instructions, the TMS320C3x possesses a full complement of general-purpose instruc- 
tions. 

The instruction set contains 113 instructions organized into the following functional 
groups: 

Qi Load-and-store 

Q Two-operand arithmetic/logical 

Qi Three-operand arithmetic/logical 

Qi Program control 

Q Interlocked operations 

Q Parallel operations 

Each of these groups is discussed in the succeeding subsections. 

10.1.1 Load-and-Store Instructions 

The TMS320C3x supports 12 load-and-store instructions (see Table 10-1). These in- 
structions can 

Q Load a word from memory into a register, 

Qi Store a word from a register into memory, or 

Q Manipulate data on the system stack. 

Two of these instructions can load data conditionally. This is useful for locating the maxi- 
mum or minimum value in a data set. See Section 1 0.2 for detailed information on condi- 
tion codes. 

Table 10-1. Load-and-Store Instructions 



Instruction 


Description 


Instruction 


Description 


LDE 


Load floating-point exponent 


POP 


Pop integer from stack 


LDF 


Load floating-point value 


POPF 


Pop floating-point value from stack 


LDFcond 


Load floating-point value 
conditionally 


PUSH 


Push integer on stack 


LDI 


Load integer 


PUSHF 


Push floating-point value on stack 


LD I cond 


Load integer conditionally 


STF 


Store floating-point value 


LDM 


Load floating-point mantissa 


STI 


Store integer 
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10.1.2 Two-Operand Instructions 



The TMS320C3x supports a complete set of 35 two-operand arithmetic and logical in- 
structions. The two operands are the source and destination. The source operand may 
be a memory word, a register, or a part of the instruction word. The destination operand is 
always a register. 

These instructions provide integer, floating-point, or logical operations, and multiprecision 
arithmetic. Table 10-2 lists these instructions. 



Table 10-2. Two-Operand Instructions 



Instruction 


Description 


Instruction 


Description 


ABSF 


Absolute value of a floating- 
point number 


NORM 


Normalize floating-point value 


ABSI 


Absolute value of an integer 


NOT 


Bitwise logical-complement 


ADDCt 


Add integers with carry 


ORt 


Bitwise logical-OR 


ADDFt 


Add floating-point values 


RND 


Round floating-point value 


ADDlt 


Add integers 


ROL 


Rotate left 


ANDt 


Bitwise logical-AND 


ROLC 


Rotate left through carry 


ANDNt 


Bitwise logical-AND with 
complement 


ROR 


Rotate right 


ASHt 


Arithmetic shift 


RORC 


Rotate right through carry 


CMPFt 


Compare floating-point values 


SUBBt 


Subtract integers with borrow 


CMPlt 


Compare integers 


SUBC 


Subtract integers conditionally 


FIX 


Convert floating-point value to 
integer 


SUBF 


Subtract floating-point values 


FLOAT 


Convert integer to floating-point 
value 


SUBI 


Subtract integer 


LSHt 


Logical shift 


SUBRB 


Subtract reverse integer with 
borrow 


MPYFt 


Multiply floating-point values 


SUBRF 


Subtract reverse floating-point 
value 


MPYlt 


Multiply integers 


SUBRI 


Subtract reverse integer 


NEGB 


Negate integer with borrow 


TSTBt 


Test bit fields 


NEGF 


Negate floating-point value 


XORt 


Bitwise exclusive-OR 


NEGI 


Negate integer 







t Two- and three-operand versions 
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10.1.3 Three-Operand Instructions 

Most instructions have only two operands; however, some arithmetic and logical instruc- 
tions have three-operand versions. The 17 three-operand instructions allow the 
TMS320C3x to read two operands from memory or the CPU register file in a single cycle 
and store the results in a register. The following differentiates the two- and three-operand 
instructions: 

□i Two-operand instructions have a single source operand (or shift count) and a destina- 
tion operand. 

Qi Three-operand instructions may have two source operands (or one source operand 
and a count operand) and a destination operand. A source operand may be a memory 
word or a register. The destination of a three-operand instruction is always a register. 

Table 1 0-3 lists the instructions that have three-operand versions. Note that the 3 in the 
mnemonic can be omitted from three-operand instructions (see Section 10.3.2). 

Table 10-3. Three-Operand Instructions 



Instruction 


Description 


Instruction 


Description 


ADDC3 


Add with carry 


MPYF3 


Multiply floating-point values 


ADDF3 


Add floating-point values 


MPYI3 


Multiply integers 


ADDI3 


Add integers 


OR3 


Bitwise logical-OR 


AND3 


Bitwise logical-AND 


SUBB3 


Subtract integers with borrow 


ANDN3 


Bitwise logical-AND with complement 


SUBF3 


Subtract floating-point values 


ASH3 


Arithmetic shift 


SUBI3 


Subtract integers 


CMPF3 


Compare floating-point values 


TSTB3 


Test bit fields 


CMPI3 


Compare integers 


XOR3 


Bitwise exclusive-OR 


LSH3 


Logical shift 







10.1.4 Program Control Instructions 

The program-control instruction group consists of all of those instructions (1 6) that affect 
program flow. The repeat mode allows repetition of a block of code (RPTB) or of a single 
line of code (RPTS). Both standard and delayed (single-cycle) branching are supported. 
Several of the program control instructions are capable of conditional operations (see 
Section 11.2 for detailed information on condition codes). Table 10-4 lists the program 
control instructions. 
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Table 10-4. Program Control Instructions 



Instruction 


Description 


Instruction 


Description 


Bcond 


Branch conditionally (standard) 


IDLE 


Idle until interrupt 


BcondD 


Branch conditionally (delayed) 


NOP 


No operation 


BR 


Branch unconditionally (standard) 


RETIcond 


Return from interrupt conditionally 


BRD 


Branch unconditionally (delayed) 


RETScond 


Return from subroutine 
conditionally 


CALL 


Call subroutine 


RPTB 


Repeat block of instructions 


CMLcond 


Call subroutine conditionally 


RPTS 


Repeat single instruction 


DBcond 


Decrement and branch 
conditionally (standard) 


SWI 


Software interrupt 


DBcondD 


Decrement and branch 
conditionally (delayed) 


TRAPcond 


Trap conditionally 


IACK 


Interrupt acknowledge 





10.1.5 Interlocked Operations Instructions 

The interlocked operations instructions support multiprocessor communication and the 
use of external signals to allowfor powerful synchronization mechanisms. They also guar- 
antee the integrity of the communication and result in a high-speed operation. Refer to 
Chapter 7 for examples of the use of interlocked instructions. 

Table 10-5. Interlocked Operations Instructions 



Instruction 


Description 


Instruction 


Description 


LDFI 


Load floating-point value, interlocked 


STFI 


Store floating-point value, interlocked 


LDII 


Load integer, interlocked 


STII 


Store integer, interlocked 


SIGI 


Signal, interlocked 







10.1.6 Parallel Operations Instructions 

The parallel-operations instructions group makes a high degree of parallelism possible. 
Some of the TMS320C3x instructions can occur in pairs that will be executed in parallel. 
These instructions offer the following features: 

Qi Parallel loading of registers, 

Qi Parallel arithmetic operations, or 

Q Arithmetic/logical instructions used in parallel with a store instruction. 

Each instruction in a pair is entered as a separate source statement, the second instruc- 
tion in the pair must be preceded by two vertical bars (| |). Table 1 0-6 lists the valid instruc- 
tion pairs. 
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Table 10-6. Parallel Instructions 



Mnemonic 


Description 


Parallel Arithmetic with Store Instructions 


ABSF 
IISTF 


Absolute value of a floating-point number and store floating-point value 


ABSI 
IISTI 


Absolute value of an integer and store integer 


ADDF3 
IISTF 


Add floating-point values and store floating-point value 


ADDI3 
HSTI 


Add integers and store integer 


AND3 
HSTI 


Bitwise logical-AND and store integer 


ASH3 
HSTI 


Arithmetic shift and store integer 


FIX 

HSTI 


Convert floating-point to integer and store integer 


FLOAT 
||STF 


Convert integer to floating-point value and store floating-point value 


LDF 
IISTF 


Load floating-point value and store floating-point value 


LDI 
HSTI 


Load integer and store integer 


LSH3 
HSTI 


Logical shift and store integer 


MPYF3 
HSTF 


Multiply floating-point values and store floating-point value 


MPYI3 
HSTI 


Multiply integer and store integer 


NEGF 
||STF 


Negate floating-point value and store floating-point value 


NEGI 
HSTI 


Negate integer and store integer 


NOT 
HSTI 


Complement value and store integer 


OR3 
HSTI 


Bitwise logical-OR value and store integer 


STF 
IISTF 


Store floating-point values 


STI 
IISTI 


Store integers 


SUBF3 
HSTF 


Subtract floating-point value and store floating-point value 
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Table 10-6. Parallel Instructions (Concluded) 



Mnemonic 


Description 


Parallel Arithmetic with Store Instructions (Concluded) 


SUBI3 
||STI 


Subtract integer and store integer 


XOR3 
||STI 


Bitwise exclusive-OR values and store integer 


Parallel Load Instructions 


LDF 
|| LDF 


Load floating-point 


LDI 
||LDi 


Load integer 


Parallel Multiply and Add/Subtract Instructions 


MPYF3 
|| ADDF3 


Multiply and add floating-point 


MPYF3 
|| SUBF3 


Multiply and subtract floating-point 


MPYI3 
|| ADDI3 


Multiply and add integer 


MPYI3 
|| SUBI3 


Multiply and subtract integer 
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10.2 Condition Codes and Flags 

The TMS320C3x provides 20 condition codes (00000 — 1 01 00 excluding 01 011) that can 
be used with any of the conditional instructions, such as RETScondor LDFcond. The con- 
ditions include signed and unsigned comparisons, comparisons to zero, and comparisons 
based on the status of individual condition flags. Note that all conditional instructions can 
accept the suffix U to indicate unconditional operation. 

Seven condition flags provide information about properties of the result of arithmetic and 
logical instructions. The condition flags are stored in the status register (ST) and are af- 
fected by an instruction only when either of the following two cases occurs: 

Qi The destination register is one of the extended-precision registers (R7 — R0). This 
allows for modification of the registers used for addressing but does not affect the con- 
dition flags during computation. 

□ The instruction is one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, 
TSTB, or TSTB3). This makes it possible to set the condition flags according to the 
contents of any of the CPU registers. 

The condition flags may be modified by most instructions when either of the preceding 
conditions is established and either of the following two cases occurs: 

Qi A result is generated when the specified operation is performed to infinite precision. 
This is appropriate for compare and test instructions that do not store results in a regis- 
ter. It is also appropriate for arithmetic instructions that produce underflow or overflow. 

Qi The output is written to the destination register as shown in Table 1 0-7. This is appro- 
priate for other instructions that modify the condition flags. 

Table 10-7. Output Value Formats 



Type Of Operation 


Output Format 


■ Floating-point 


8-bit exponent, 1 sign bit, 31 -bit fraction 


Integer 


32-bit integer 


Logical 


32-bit unsigned integer 



Figure 1 0-1 shows the condition flags in the low-order bits of the status register. Following 
the figure is a list of status register condition flags and descriptions on how the flags are 
set by most instructions. For specific details of the effect of a particular instruction on the 
condition flags, see the description of that instruction in subsection 10.3.3. 
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Figure 10-1. Status Register 
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LUF Latched Underflow Condition Flag. LUF isset whenever UF (floating-point un- 
derflow flag) is set. LUF may be cleared only by a processor reset or by modifying it 
in the status register (ST). 

LV Latched Overflow Condition Flag. LV is set whenever V (overflow condition flag) 
is set. Otherwise, it is unchanged. LV may be cleared only by a processor reset or by 
modifying it in the status register (ST). 

UF Floating-Point Underflow Condition Flag. A floating-point underflow occurs 
whenever the exponent of the result is less than or equal to -1 28. If a floating-point 
underflow occurs, UF is set, and the output value is set to 0. UF is cleared if a floa- 
ting-point underflow does not occur. 

N Negative Condition Flag. Logical operations assign N the state of the MSB of the 
output value. For integer and floating-point operations, N is set if the result is nega- 
tive, and cleared otherwise. Zero is positive. 

Z Zero Condition Flag. For logical, integer, and floating-point operations, Z is set if 
the output is 0, and cleared otherwise. 

V Overflow Condition Flag. For integer operations, V is set if the result does not fit 
into the format specified for the destination (i.e., - 2 32 < result < 2 32 - 1 ). Other- 
wise, V is cleared. For floating-point operations, V is set if the exponent of the result 
is greater than 127; otherwise, V is cleared. Logical operations always clear V. 

C Carry Flag. When an integer addition is performed, C is set if a carry occurs out of 
the bit corresponding to the MSB of the output. When an integer subtraction is per- 
formed, C is set if a borrow occurs into the bit corresponding to the MSB of the out- 
put. Otherwise, for integer operations, C is cleared. The carry flag is unaffected by 
floating-point and logical operations. For shift instructions, this flag is set to the final 
value shifted out; for a zero shift count, this is set to zero. 
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Table 10-8 lists the condition mnemonic, code, description, and flag for each of the 19 
condition codes. 



Table 10-8. Condition Codes and Flags 



Condition 


Code 


Description 


Flagf 


Unconditional Compares 


U 


00000 


Unconditional 


Don't care 


Unsigned Compares 


LO 


00001 


Lower than 


C 


LS 


00010 


Lower than or same as 


CORZ 


HI 


00011 


Higher than 


~C AND ~Z 


HS 


00100 


Higher than or same as 


~C 


EQ 


00101 


Equal to 


Z 


NE 


00110 


Not Equal to 


~Z 


Signed Compares 


LT 


00111 


Less than 


N 


LE 


01000 


Less than or equal to 


NORZ 


GT 


01001 


Greater than 


~N AND ~Z 


GE 


01010 


Greater than or equal to 


~N 


EQ 


00101 


Equal to 


Z 


NE 


00110 


Not equal to 


~z 


Compare to Zero 


Z 


00101 


Zero 


z 


NZ 


00110 


Not zero 


~z 


P 


01001 


Positive 


~N AND ~Z 


N 


00111 


Negative 


N 


NN 


01010 


Nonnegative 


~N 


Compare to Condition Flags 


NN 


01010 


Nonnegative 


~N 


N 


00111 


Negative 


N 


NZ 


00110 


Nonzero 


~Z 


Z 


00101 


Zero 


z 


NV 


01100 


No overflow 


~v 


V 


01101 


Overflow 


V 


NUF 


01110 


No underflow 


~UF 


UF 


01111 


Underflow 


UF 


NC 


00100 


No carry 


~C 


C 


00001 


Carry 


C 


NLV 


10000 


No latched overflow 


~LV 


LV 


10001 


Latched overflow 


LV 


NLUF 


10010 


No latched floating-point underflow 


-LUF 


LUF 


10011 


Latched floating-point underflow 


LUF 


ZUF 


10100 


Zero or floating-point underflow 


ZORUF 



t The ~ means logical complement ("not true" condition). 
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10.3 Individual Instructions 

This section contains the individual assembly language instructions for the TMS320C3x. 
The instructions are listed in alphabetical order. Information for each instruction includes 
assembler syntax, operation, operands, encoding, description, cycles, status bits, mode 
bit, and examples. 

Definitions of the symbols and abbreviations, as well as optional syntax forms allowed by 
the assembler, precede the individual instruction description section. Also, an example in- 
struction shows the special format used and explains its content. 

A functional grouping of the instructions as well as a complete instruction set summary 
can be found in Section 10.1. Appendix B lists the opcodes for all the instructions. Refer 
to Chapter 6 for information on memory addressing. Code examples using many of the 
instructions are given in Chapter 11, Software Applications. 

10.3.1 Symbols and Abbreviations 

Table 1 0-9 lists the symbols and abbreviations used in the individual instruction descrip- 
tions. 

Table 10-9. Instruction Symbols 



Symbol 


Meaning 


src 

srd 

src2 

src3 

src4 


Source operand 
Source operand 1 
Source operand 2 
Source operand 3 
Source operand 4 


dst 

dstl 

dst2 

disp 

cond 

count 


Destination operand 
Destination operand 1 
Destination operand 2 
Displacement 
Condition 
Shift count 


G 
T 
P 
B 


General addressing modes 
Three-operand addressing modes 
Parallel addressing modes 
Conditional-branch addressing modes 


ARn 

IRn 

Rn 

RC 

RE 

RS 

ST 


Auxiliary register n 
Index register n 
Register address n 
Repeat count register 
Repeat end address register 
Repeat start address register 
Status register 
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Table 10-9. Instruction Symbols (Concluded) 



Symbol 


Meaning 


C 

GIE 

N 

PC 

RM 

SP 


Carry bit 

Global interrupt enable bit 

Trap vector 

Program counter 

Repeat mode flag 

System stack pointer 


|x| 

x — »y 
x(man) 
x(exp) 


Absolute value of x 
Assign the value of x to destination y 
Mantissa field (sign + fraction) of x 
Exponent field of x 


op1 
||o P 2 


Operation 1 performed in parallel with operation 2 


xANDy 

xORy 

xXORy 

~x 


Bitwise logical-AND of x and y 
Bitwise logical-OR of x and y 
Bitwise logical-XOR of x and y 
Bitwise logical-complement of x 


x«y 
x»y 
*++SP 
*SP — 


Shift x to the left y bits 

Shift x to the right y bits 

Increment SP and use incremented SP as address 

Use SP as address and decrement SP 



10.3.2 Optional Assembler Syntaxes 

The assembler allows a relaxed syntax form for some instructions. These optional forms 
simplify the assembly language so that special-case syntax can be ignored. The following 
is a list of these optional syntax forms. 

Qi The destination register can be omitted on unary arithmetic and logical operations 
when the same register is used as a source. For example, 

absi ro,ro can be written as absi ro 

Instructions affected: ABSI, ABSF, FIX, FLOAT, NEGB, NEGF, NEGI, NORM, NOT, 
RND. 

Q All 3-operand instructions can be written without the 3. For example, 

addi3 ro,ri,r2 can be written as addi ro,ri,r2 

Instructions affected: ADDC3, ADDF3, ADDI3, AND3, ANDN3, ASH3, LSH3, 
MPYF3, MPYI3, OR3, SUBB3, SUBF3, SUBI3, XOR3. 

This also applies to all the pertinent parallel instructions. 

Q All 3-operand comparison instructions can be written without the 3. For example, 

cmp 1 3 ro,*aro can be written as cmpi ro,*aro 

Instructions affected: CMPI3, CMPF3, TSTB3. 

Qi Indirect operands with an explicit displacement are allowed. In 3-operand or parallel 
instructions, operands with displacement are automatically converted to no-dis- 
placement mode. For example: 
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ldi *+arO(0),ri is legal 
Also 

ADD 1 3 *+AR0 (0) ,R1,R2 is equivalent tO ADDI3 *AR0,R1,R2 

Qi Indirect operands can be written with no displacement, in which case a displacement 
of one is assumed. For example, 

ldi *aro++(1),ro canbewritten ldi *aro++,ro 

d All conditional instructions accept the suffix U to indicate unconditional operation. 
Also, the U can be omitted from unconditional short branch instructions. For example: 

bu label canbewritten b label 

Q[ Labels can be written with or without a trailing colon. For example: 

labelO: NOP 
labell NOP 
label2 : (label assembles to next source line) 

Qi Empty expressions are not allowed for the displacement in indirect mode: 

ldi *+aro ( ) , ro is not legal 

Qi Long immediate mode operands (destination of BR and CALL) can be written with an 
at sign: 

br label canbewritten br @iabel 

Qi The LDP pseudo-op can be used to load a register (usually DP) with the 8 MSBs of 
a relocatable address as follows: 

LDP addr,REG or LDP @addr,REG 

The at sign is optional. 

If the destination REG is the DP, the DP can be omitted in the operand. LDP generates 
an LDI instruction with an immediate operand, and a special relocation type. 

Q Parallel instructions can be written in either order. For example: 

addi can be written as sti 

| | STI I | ADDI 

Qi The parallel bars indicating part 2 of a parallel instruction can be written anywhere on 
the line from column to the mnemonic. For example: 

addi can be written as addi 

| | STI I i STI 

Q If the second operand of a parallel instruction is the same as the third (destination reg- 
ister) operand, the third operand can be omitted. This allows the writing of 3-operand 
parallel instructions that look like normal 2-operand instructions. For example, 

addi *aro,r2,r2 can be written as add *aro,r2 

M MPYI *AR1,R0,R0 II MPYI *AR1,R0 
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Instructions (applies to all parallel instructions that have a register second operand) 
affected: ADDI, ADDF, AND, MPYI, MPYF, OR, SUBI, SUBF, XOR. 

All commutative operations in parallel instructions can be written in either order. For 
example, the ADDI part of a parallel instruction can be written in either of two ways: 



ADDI *AR0,R1,R2 



or 



ADDI R1,*AR0,R2 



The instructions affected are parallel instructions containing any of the following: 
ADDI, ADDF, MPYI, MPYF, AND, OR, XOR. 

□ Use the syntax in Table 1 0-1 to designate CPU registers in operands. Note the alter- 
nate form using designators RO — R27. 

10.3.3 Individual Instruction Descriptions 

Each assembly language instruction for the TMS320C3x is described in this section in al- 
phabetical order. The description includes the assembler syntax, operation, operands, en- 
coding, description, cycles, status bits, mode bit, and examples. 

Table 10-10. CPU Register Syntax 



Assemblers 
Syntax 


Alternate Register 
Syntax 


Assigned 
Function 


RO 
R1 
R2 
R3 
R4 
R5 
R6 
R7 


RO 
R1 
R2 
R3 
R4 
R5 
R6 
R7 


Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 


ARO 
AR1 
AR2 
AR3 
AR4 
AR5 
AR6 
AR7 


R8 

R9 

R10 

R11 

R12 

R13 

R14 

R15 


Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
auxiliary register 
Auxiliary register 
Auxiliary register 


DP 
IRO 
IR1 
BK 
SP 


R16 
R17 
R18 
R19 
R20 


Data-page pointer 
Index register 
Index register 1 
Block-size register 
Active stack pointer 


ST 
IE 
IF 
IOF 


R21 
R22 
R23 
R24 


Status register 
CPU/DMA interrupt enable 
CPU interrupt flags 
I/O flags 


RS 
RE 
RC 


R25 
R26 
R27 


Repeat start address 
Repeat end address 
Repeat counter 
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EXAMPLE Example Instruction 



Syntax 



INST src, dst 



or 



Operation 



INST1 src2, dstl 
II INST2 src3, dst2 

Each instruction begins with an assembler syntax expression. Labels may be 
placed either before the command (instruction mnemonic) on the same line or 
on the preceding line in the first column. The optional comment field that con- 
cludes the syntax is not included in the syntax expression. Space(s) are re- 
quired between each field (label, command, operand, and comment fields). 

The syntax examples illustrate the common one-line syntax and the two-line 
syntax used in parallel addressing. Note that the two vertical bars || that indi- 
cate a parallel addressing pair can be placed anywhere before the mnemonic 
on the second line. The first instruction in the pair can have a label, but the sec- 
ond instruction cannot have a label. 

I src | -> dst 



or 



Operands 



\src2 \^>dst1 
|| src3->dst2 

The instruction operation sequence describes the processing that takes place 
when the instruction is executed. For parallel instructions, the operation se- 
quence is performed in parallel. Conditional effects of status register specified 
modes are listed for conditional instructions such as Bcond. 

src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 27) 

or 

src2 indirect (disp = 0, 1, IRO, IR.1) 

dstl register (Rn1 , < n1 < 7) 

src3 register (Rn2, < n2 < 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1 ) 

Operands are defined according to the addressing mode and/or the type of ad- 
dressing used. Note that indirect addressing uses displacements and the index 
registers. Refer to Chapter 5 for detailed information on addressing. 
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Assembly Language Instructions 



Example Instruction EXAMPLE 



Encoding 



31 



24 23 



16 15 



1 1 




l—l' I I I 

INST 


" I™ 

G 


— I I I I 

dst 


— I — I — I — I — I — I — I — I" "'I "' "I I i i i i 

src 


31 


or 

24 23 16 15 




87 







1 1 


I I 

INSTll 


"1 1— 

1 INST2 


— 1 1 

dstl 


— I — I 




ll 

src3 


l l l l I I I 

dst2 


"""1 "" i i i i i 

src2 



Description 



Cycles 



Status Bits 



Encoding examples are shown using general addressing and parallel addres- 
sing. The instruction pair forthe parallel addressing example consists of INST1 
and INST2. 

Instruction execution and its effect on the rest of the processor or memory con- 
tents are described. Any constraints on the operands imposed by the proces- 
sor orthe assembler are discussed. The description parallels and supplements 
the information given by the operation block. 

1 

The digit specifies the number of cycles required to execute the instruction. 

LUF Latched Floating-Point Underflow Condition Flag. 1 if a floating- 
point underflow occurs, unchanged otherwise. 

LV Latched Overflow Condition Flag. 1 if an integer or floating-point 
overflow occurs, unchanged otherwise. 

UF Floating-Point Underflow Condition Flag. 1 if a floating-point un- 
derflow occurs, otherwise. 

N Negative Condition Flag. 1 if a negative result is generated, other- 

wise. In some instructions, this flag is the MSB of the output. 

Z Zero Condition Flag. 1 if a zero result is generated, otherwise. For 

logical and shift instructions, 1 if a zero output is generated, other- 
wise. 

V Overflow Condition Flag. 1 if an integer or floating-point overflow oc- 

curs, otherwise. 

C Carry Flag. 1 if a carry or borrow occurs, otherwise. For shift instruc- 

tions, this flag is set to the value of the last bit shifted out; for a shift 
count of 0. 

The seven condition flags stored in the status register (ST) are modified by the 
majority of instructions only if the destination register isR7 — RO.Theyprovide 
information about the properties of the result or output of arithmetic or logical 
operations. 
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EXAMPLE Example Instruction 



Mode Bit 



Example 



OVM Overflow Mode Flag. In general, integer operations are affected by the 
OVM bit value (described in Table 3-2 on page 3-7). 

INST @98AEh,R5 

Before Instruction: 



DP = 80h 

R5 = 0766900000h = 2.30562500e+02 

Memory at 8098AEh = 5CDFh = 1 .00001 1 07e + 00 

LUF LV UF N Z V C = 00 

After Instruction: 

DP = 80h 

R5 = 0066900000h = 1 .80126953e + 00 

Memory at 8098AEh = 5CDFh = 1 .00001 1 07e + 00 

LUF LV UF N Z V C = 00 

The sample code presented in the above format shows the effect of the code 
on system pointers (e.g., DP or SP), registers (e.g., R1 or R5), memory at spe- 
cific locations, and the seven status bits. The values given for the registers in- 
clude the leading zeros to showthe exponent in floating-point operations. Deci- 
mal conversions are provided for all register and memory locations. The seven 
status bits are listed in the order in which they appear in the assembler and sim- 
ulator (see Table 10-8 and Section 10.2 on page10-9 for further information 
on these seven status bits). 
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Assembly Language Instructions 



Absolute Value of Floating-Point ABSF 



Syntax 


ABSF src, dst 


Operation 


\src\ -> dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 7) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



t — r— 





dst register (Rn, < n < 7) 



24 23 



i I i I I 





t — i — r 

dst 



16 15 



t — I — i — I — I — i — r 



87 



t — r 

src 



t — i — i — i — i — r 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The absolute value of the src operand is loaded into the dst register. The src 
and dst operands are assumed to be floating-point numbers. 

An overflow occurs if src (man) = 80000000h and src (exp) = 7Fh. The result 
is dst (man) = 7FFFFFFFh and dst (exp) = 7Fh. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

UF 0. 

N 0. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

ABSF R4,R7 

Before Instruction: 

R4 = 05C8000F971 h = -9.90337307e + 27 
R7 = 07D251 1 0OAEh = 5.48527255e + 37 

LUF LV UF N Z V C = 00 

After Instruction: 

R4 = 05C8000F971 h = -9.90337307e + 27 
R7 = 05C7FFF068Fh = 9.90337307e + 27 
LUF LV UF N Z V C = 
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ABSFIISTF Parallel ABSF and STF 



Syntax 

Operation 

Operands 



ABSF 
STF 



src2, dstl 
src3, dst2 



\src2 \->dst1 
|| src3-*dst2 

src2 indirect (disp = 0, 1 , IRO, IR1 ) 

dstl register (Rn1 , < n1 < 7) 

src3 register (Rn2, < n2 < 7) 

dst2 indirect (disp = 0,1, IRO,. IR1 ) 



Encoding 



31 



1 1 



24 23 



Description 



i I i i 

10 



t — r 

dstl 



~1 — I — 




16 15 



T r 

src3 



i — i — i — i — i — i — r 

dst2 



87 



t — i — i — i — i — i — r 

src2 



A floating-point absolute value and a floating-point store are performed in par- 
allel. All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STF) reads from a reg- 
ister and the operation being performed in parallel (ABSF) writes to the same 
register, then STF accepts as input the contents of the register before it is modi- 
fied by the ABSF. 

If src2 and ds?2pointto the same location, src2\s read before the write to dst2. 
If src3 and dst 1 point to the same register, src3 is read before the write to dst 1. 

An overflow occurs if src (man) = 80000000h and src (exp) = 7Fh. The result 
is dst (man) = 7FFFFFFFh and dst (exp) = 7Fh. 



Cycles 


1 


Status Bits 


LUF Unaffected. 




LV 1 if a floating-point overflow occurs, unchanged otherwise 




UF 0. 




N 0. 




Z 1 if a zero result is generated, otherwise. 




V 1 if a floating-point overflow occurs, otherwise. 




C Unaffected. 


Mode Bit 


OVM Operation is not affected by OVM bit value. 


Example 


ABSF *++AR3(IRl) , R4 
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Assembly Language Instructions 



Parallel ABSF and STF ABSFNSTF 



Before Instruction: 

AR3 = 809800h 

IR1 = OAFh 

R4 = 733C00000h = 1 .79750e + 02 

AR7 = 8098C5h 

Data at 8098AFh = 58B4000h = - 6.118750e + 01 

Data at 8098C4h = Oh 

LUFLV UF N Z V C = 00 

After Instruction: 

AR3 = 8098AFh 

IR1 = OAFh 

R4 = 574C00000h = 6.1.1 8750e + 01 

AR7 = 8098C5h 

Data at 8098AFh = 58B4000h = -6.1 1 8750e + 01 

Data at 8098C4h = 733C000h = 1 .79750e + 02 

LUFLV UF N Z V C = 00 
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ABSI Absolute Value of Integer 



Syntax 


ABSI src, dst 


Operation 


\src\ -> dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 27) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



dst register (Rn, < n < 27) 



31 




24 23 






16 15 


87 





I 1 




1 1 




1 1 1 

1 


1 
G 


1 1 1 l 

dst 


1 1 I I l I l I I l I l l l l 

src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 1 



The absolute value of the src operand is loaded into the dst register. The src 
and dst operands are assumed to be signed integers. 

An overflow occurs if src = 80000000h. If ST(OVM) = 1, the result is 
dst = 7FFFFFFFh. If ST(OVM) = 0, the result is dst = 80000000h. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 0. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is affected by OVM bit value. 

ABSI R0,R0 

or 

ABSI RO 

Before Instruction: 

R0 = OFFFFFFCBh = - 53 
After Instruction: 

R0 = 035h = 53 



10-22 



Assembly Language Instructions 



Absolute Value of Integer ABSI 



Example 2 ABS I *AR1 , R3 

Before Instruction: 



AR1 = 20h 

R3 = 0h 

Data at 20h = OFFFFFFCBh = - 53 

After Instruction: 

AR1 = 20h 

R3 = 35h = 53 

Data at 20h = OFFFFFFCBh = - 53 
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ABSIJISTI Parallel ABSI and STI 



Syntax 



Operation 



Operands 



Encoding 



31 



1 1 



ABSI src2, dstl 

|| STI src3,dst2 

\src2 \-^dst1 
|| src3->dst2 

src2 indirect (disp = 0, 1 , IRO, IR1 ) 

dstl register (Rn1, 0<1 <7) 

src3 register (Rn2, < n2 < 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1 ) 



— i — i — i — i— 

10 1 



24 23 



t — r 
dstl 



— i — i — 





t — r 

src3 



16 15 



i — i — i — i — i — i — r 

dst2 



87 



i — i — i — i — i — i — r 

src2 



Description 



Cycles 
Status Bits 



An integer absolute value and an integer store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (STI) reads from a register and 
the operation being performed in parallel (ABSI) writes to the same register, 
then STI accepts as input the contents of the register before it is modified by 
the ABSI. 

If src2 and appoint to the same location, src2 is read before the write to dst2. 

An overflow occurs if src = 80000000h. If ST(OVM) = 1 , the result is dst = 
7FFFFFFFh. If ST(OVM) = 0, the result is dst = 80000000h. 



Mode Bit 



1 

These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 0. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is affected by OVM bit value. 



R0. 
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Assembly Language Instructions 



Parallel ABSI and STI ABSINSTI 



Example ABSI *-AR5(l),R5 

I | STI R1,*AR2--(IR1) 

Before Instruction: 



AR5 = 8099E2h 

R5 = 0h 

R1 = 42h = 66 

AR2 = 8098FFh 

IR1 =0Fh 

Data at 8099E1 h = OFFFFFFCBh = - 53 

Data at 8098FFh = 2h = 2 

LUFLV UF N Z V C = 00 

After Instruction: 

AR5 = 8099E2h 

R5 = 35h = 53 

R1 = 42h = 66 
AR2 = 8098F0h 
IR1=0Fh 

Data at 8099E1 h = OFFFFFFCBh = - 53 
Data at 8098FFh = 42h = 66 
LUFLV UF N Z V C = 00 
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ADDC Add Integer With Carry 



Syntax 

Operation 

Operands 



Encoding 



31 



T — T" 





ADDC src, dst 

dst + src + C -» dst 

src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 27) 



2423 

i — i — i — i — r 



o o 



1 



i — i — r 

dst 



16 15 87 

i — i — i — i — i — i — i — i — i — i — i — i — i — i — r 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The sum of the dst and src operands and the C (carry) flag is loaded into the 
dst register. The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a carry occurs, otherwise. 

OVM Operation is affected by OVM bit value. 

ADDC R1,R5 

Before Instruction: 

R1 = 00FFFF5C25h = - 41 ,947 
R5 = 00FFFF01 9Eh = - 65, 1 22 

LUFLV UF N Z V C = 00 

After Instruction: 

R1 = 00FFFF5C25h = - 41 ,947 ( 

R5 = 00FFFE5DC4h = - 1 07,068 
LUFLVUFNZVC = 0000000 
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Assembly Language Instructions 



Add Integer With Carry, 3-Operand ADDC3 



Syntax 

Operation 

Operands 



Encoding 



31 



T 1— 

1 



ADDC3 src2, srd, dst 
srd + src2 + C -» dst 

srd three-operand addressing modes (T): 
register (Rn1 , < n1 < 27) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rn1, < n1 <27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 
10 indirect (disp = 0, 1 , IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, < n < 27) 



24 23 



— 1 — I — I — I T 





i — i — r 

dst 



16 15 



t — i — i — i — i — r 

srd 



87 



t — i — i — i — i — r 

src2 



Description 

Cycles 
Status Bits 



Mode Bit 



The sum of the srd and src2 operands and the C (carry) flag is loaded into the 
dst register. The srd, src2, and dst operands are assumed to be signed inte- 
gers. 

1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

U 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a carry occurs, otherwise. 

OVM Operation is affected by OVM bit value. 
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ADDC3 Add Integer With Carry, 3-Operand 



Example 1 



Example 2 



ADDC3 *AR5++ (IRO) ,R5,R2 

or 
ADDC3 R5, *AR5 + + (IRO) , R2 

Before Instruction: 

AR5 = 809908h 

IRO = 10h 

R5 = 066h = 102 

R2 = 0h 

Data at 809908h = OFFFFFFCBh = - 53 

LUF LV UF N Z V C = 1 

After Instruction: 

AR5 = 80991 8h 

IR0 = 10h 

R5 = 066h=102 

R2 = 032h = 50 

Data at 809908h = OFFFFFFCBh = - 53 

LUF LV UF N Z V C = 1 

ADDC3 R2, R7, RO 

Before Instruction: 

R2 = 02BCh = 700 

R7 = 0F82h = 3970 

R0 = 0h 

LUF LV UF N Z V C = 1 

After Instruction: 

R2 = 02BCh = 700 

R7 = 0F82h = 3970 

R0 = 0123Fh = 4671 

LUFLV UF N Z V C = 00 
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Assembly Language Instructions 



Add Floating-Point ADDF 



Syntax 

Operation 

Operands 



Encoding 



31 



T — I— 




ADDF src, dst 

dst + src -» dst 

src general addressing modes (G): 

register (Rn, < n < 7) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 7) 



24 23 



16 15 



I I I I I 

1 1 



dst 



t — i — r 



87 



1 — I — I — I — I — r 
src 



t — i — i — i — i — r 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The sum of the dstand srcoperands is loaded into the dst register. The dst and 
src operands are assumed to be floating-point numbers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF 

LV 

UF 

N 

Z 

V 

c 



1 if a floating-point underflow occurs, unchanged otherwise. 

1 if a floating-point overflow occurs, unchanged otherwise. 

1 if a floating-point underflow occurs, otherwise. 

1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an floating-point overflow occurs, otherwise. 

Unaffected. 



OVM Operation is not affected by OVM bit value. 

ADDF *AR4 + +(IR1) , R5 

Before Instruction: 

AR4 = 809800h 

IR1 =12Bh 

R5 = 0579800000h = 6.23750e+01 

Data at 809800h = 86B2800h = 4.7031 250e + 02 

LUF LV UF N Z V C = 

After Instruction: 

AR4 = 80992Bh 

IR1 =12Bh 

R5 = 09052C0000h = 5.3268750e+02 

Data at 809800h = 86B2800h = 4.7031 250e + 02 

LUFLV UF N Z V C = 00 



10-29 



ADDF3 Ada [Floating-Point, 3-Operand 



WWWWW*SAWW!WStoWSr"rtW*!»W*WftW# 



Syntax 

Operation 

Operands 



Encoding 



31 



T — I— 

1 



ADDF3 src2, srd, dst 
srd + src2 -> dst 

srd three-operand addressing modes (T): 
register (Rn1 , < n1 < 7) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rn1, < n1 <7) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 7) 

1 register (Rn2, < n2 < 7) 

1 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

dst register (Rn, < n < 7) 



24 23 



~i — i — i — i — r 

1 



t — i — i — r 

dst 



16 15 



t — i — i — i — i — r 

srd 



87 



t — i — i — i — r 

src2 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 1 



The sum of the srd and src2operands is loaded into the dst register. The srd, 
src2, and dst operands are assumed to be floating-point numbers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF 

LV 

UF 

N 

Z 

V 

c 



1 if a floating-point underflow occurs, unchanged otherwise. 

1 if a floating-point overflow occurs, unchanged otherwise. 

1 if a floating-point underflow occurs, otherwise. 

1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an floating-point overflow occurs, otherwise. 

Unaffected. 



OVM Operation is not affected by OVM bit value. 

ADDF3 R6,R5,R1 

or 

ADDF3 R5,R6,R1 

Before Instruction: 

R6 = 086B280000h = 4.7031 250e + 02 

R5 = 0579800000h = 6.23750e+01 

R1 = Oh 

LUFLVUFNZVC = 00 00 
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Assembly Language Instructions 



Add Floating-Point, 3-Operand ADDF3 



After Instruction: 

R6 = 086B280000h = 4.7031 250e + 02 
R5 = 0579800000h = 6.23750e + 01 
R1 = 09052C0000h = 5.3268750e + 02 

LUFLV UF N Z V C = 

Example 2 ADDF3 *+ARl (1) , *AR7++ (IRO) , R4 

Before Instruction: 

AR1 = 809820h 

AR7 = 8099F0h 

IRO = 8h 

R4 = 0h 

Data at 809821 h = 700F000h = 1 .28940e + 02 

Data at 8099F0h = 34C2000h = 1 .27590e + 01 

LUFLV UF N Z V C = 00 

After Instruction: 

AR1 = 809820h 

AR7 = 8099F8h 

IR0 = 8h 

R4 = 070DB20000h = 1 .41 69531 3e + 02 

Data at 809821 h = 700F000h = 1 .28940e + 02 

Data at 8099F0h = 34C2000h = 1 .27590e + 01 

LUFLV UF N Z V C = 
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ApDF3| |STF Parallel ADDF3andSTF 



Syntax 


ADDF3 src2, srd, dstl 




|| STF src3,dst2 


Operation 


srd 


+ src2 -> dst 1 




|| src3 


-» dsX2 


Operands 


srd 


register (Rn1, < n1 <7) 




src2 


indirect (disp = 0, 1 , IRO, IR1 ) 




dstl 


register (Rn2, < n2 < 7) 




src3 


register (Rn3, < n3 < 7) 




dst2 


indirect (disp = 0, 1, IRO, IR1) 



Encoding 



31 



24 23 



1 1 



I I I I 

110 



t — r 

dstl 



srd 



t — r 

src3 



16 15 



t — i — r 



"l — i — i — r 

dst2 



87 



t — i — i — i — i — r 

src2 



Description 



Cycles 
Status Bits 



A floating-point addition and a floating-point store are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (STF) reads from a register 
and the operation being performed in parallel (ADDF3) writes to the same reg- 
ister, then STF accepts as input the contents of the register before it is modified 
by the ADDF3. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF 

LV 

UF 

N 

Z 

V 

c 



1 if a floating-point underflow occurs, unchanged otherwise. 

1 if a floating-point overflow occurs, unchanged otherwise. 

1 if a floating-point underflow occurs, otherwise. 

1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an floating-point overflow occurs, otherwise. 

Unaffected. 



Mode Bit 
Example 



OVM Operation is not affected by OVM bit value. 

ADDF3 *+AR3(IRl) , R2,R5 
I | STF R4, *AR2 
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Assembly Language Instructions 



Parallel ADDF3 andSTF ADDF3J|STF 



Before Instruction: 

AR3 = 809800h 

IR1 = 0A5h 

R2 = 070C800000h = 1 .4050e + 02 

R5 = Oh 

R4 = 057B400000h = 6.281 250e + 01 

AR2 = 8098F3h 

Data at 8098A5h = 733C000h = 1 .79750e + 02 

Data at 8098F3h = Oh 

LUF LV UF N Z V C = 

After Instruction: 

AR3 = 809800h 

IR1 = 0A5h 

R2 = 070C800000h = 1 .4050e+02 

R5 = 0820200000h = 3.20250e + 02 

R4 = 057B400000h = 6.281 250e + 01 

AR2 = 8098F3h 

Data at 8098A5h = 733C000h = 1 .79750e + 02 

Data at 8098F3h = 57B4000h = 6.281 25e + 01 

LUFLV UF N Z V C = 
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ADDI Add Integer 



Syntax 

Operation 

Operands 



Encoding 



31 



ADDI src, dst 

dst + src -» dst 

src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 27) 



t — r- 





24 23 



- 1 — i — i — i — r 

10 



1615 



dst 



i — i — i — r 



87 



t — i — i — i — i — i — i — i — r 

src 



Description 



Status Bits 



Mode Bit 
Example 



The sum of the dst and srcoperands is loaded into the the dst register. The dst 
and srcoperands are assumed to be signed integers. 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a carry occurs, otherwise. 

OVM Operation is affected by OVM bit value. 

ADDI R3,R7 

Before Instruction: 







R3 = 


OFFFFFFCBh 


= . 


-53 




R7 = 


35h = 53 








LUF 


LV UF N Z 


V 


C = 





After Instruction: 









R3 = OFFFFFFCBh = - 53 

R7 = 0h 

LUF LV UF N Z V C = 
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Assembly Language Instructions 



Add Integer, 3-Operand ADDI3 



Syntax 

Operation 

Operands 



Encoding 



31 



T — I - 

1 



AD DI3 <src2 >, <src 1 >, <dst > 

src 1 + src2 -> dst 

srd three-operand addressing modes (T): 
register (Rn1,0<n1 <27) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rn1 , < n1 < 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 
10 indirect (disp = 0, 1 , IRO, 1R1) 

1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

dst register (Rn, < n < 27) 



24 23 



— i — i — i — r 

10 



t — i — r 

dst 



16 15 



t — i — i — i — i — r 

srd 



t — i — i — i — i — i — r 

src2 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 1 



The sum of the srd and src2operands is loaded into the dst register.The srd, 
src2, and dst operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a carry occurs, otherwise. 

OVM Operation is affected by OVM bit value. 

ADDI3 R4,R7,R5 

Before Instruction: 

R4 = ODCh = 220 
R7 = 0A0h = 160 
R5 = 10h = 16 

LUFLV UF N Z V C = 00 
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ADDI3 Add Integer, 3-Operand 



After Instruction: 

R4 = ODCh = 220 
R7 = 0A0h = 160 
R5 = 017Ch = 380 

LUFLV UF N Z V C = 00 

Example2 ADDI3 *-AR3 (1) , *AR6--(IR0) , R2 

Before Instruction: 

AR3 = 809802h 

AR6 = 809930h 

IR0 = 18h 

R2 = 10h = 16 

Data at 809801 h = 2AF8h = 1 1 ,000 

Data at 809930h = 3A98h = 1 5,000 

LUFLV UF N Z V C = 00 

After Instruction: 

AR3 = 809802h 

AR6 = 80991 8h 

IR0 = 18h 

R2 = 06598h = 26,000 

Data at 809801 h = 2AF8h = 11 ,000 

Data at 809930h = 3A98h = 1 5,000 

LUFLV UF'N Z V C = 



1 o-36 Assembly Language Instructions 



Parallel ADDI3 and STI ADDI3I ISTI 



Syntax 



Operation 



Operands 



Encoding 



31 



ADDI3 src2, srd, dstl 
|| STI src3,dst2 

srd + src2 -> dstl 
|| src3^dst2 

srd register (Rn1 , < n1 < 7) 

src2 indirect (disp = 0, 1, IRO, IR1) 
dstl register (Rn2, < n2 < 7) 
src3 register (Rn3, < n3 < 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 



o 1 



— l — i — i — r— 

111 



24 23 



1615 



87 



t r 

dstl 



srd 



src3 



i i i r 

dst2 



t — i — i — i — i — r 

src2 



Description 



Cycles 
Status Bits 



An integer addition and an integer store are performed in parallel. All registers 
are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (ADDI3) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by the 
ADDI3. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 



Mode Bit 



1 

These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a carry occurs, otherwise. 

OVM Operation is affected by OVM bit value. 



R0. 
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ADDI3| |STI Parallel ADDI3 and STI 



Example ADDI3 *ARO--(IRO) , R5, RO 

II • STI R3,*AR7 

Before Instruction: 

ARO = 80992Ch 

IR0 = 0Ch 

R5 = ODCh = 220 

R0 = 0h 

R3 = 35h = 53 

AR7 = 80983Bh 

Data at 80992Ch = 12Ch = 300 

Data at 80983Bh = Oh 

LUFLV UF N Z V C = 00 

After Instruction: 

ARO = 809920h 

IRO = OCh 

R5 = ODCh = 220 

RO = 208h = 520 

R3 = 35h = 53 

AR7 = 80983Bh 

Data at 80992Ch = 12Ch = 300 

Data at 80983Bh = 35h = 53 

LUFLV UF N Z V C = 00 
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Bitwise Logical-AND AND 



Syntax 


AND src, dst 


Operands 


dst AND src -> dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 27) 




1 direct 




1 indirect 




1 1 immediate (not sign-extended) 



Encoding 



31 



dst register (Rn, < n < 27) 



t — r~ 





24 23 

- 1 — I — I — I — J - 

10 1 



t — i — i — r 

dst 



16 15 87 _^_ 

i — i — i — i — i — i — i — i — i — i — i — i — i — i — r 



src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The bitwise logical-AND between the dst and src operands is loaded into the 
dst register. The dst and src operands are assumed to be unsigned integers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

AND Rl , R2 

Before Instruction: 



R1 =80h 

R2 = OAFFh 

LUF LV UF N Z 


V 


C = 


= 

















1 


After Instruction: 





















R1 =80h 
R2 = 80h 
LUF LV UF N Z V C = 1 
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AND3 Bitwise Logical- AND, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



31 



T — T" 

1 



AND src2, srd, dst 
src 1 AND src2-^ dst 

srd three-operand addressing modes (T): 
register (Rn1 , < n1 < 27) 

1 indirect (disp = 0, 1, IRO, IR1) 
10 register (Rn1,0<n1 <27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 

1 indirect (disp = 0, 1 , IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, < n < 27) 



24 23 

— r 



— r— i — i — r 

1 1 



-1 — r 

dst 



16 15 



t — r 



t — i — r 

srd 



t — r 



t — i — i — r 

src2 



Description 

Cycles 
Status Bits 



The bitwise logical-AND between the srd and src2operands is loaded into the 
dsf register. The srd, src2, and dstoperands are assumed to be unsigned inte- 
gers. 



Mode Bit 



1 

These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 



R0. 
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Assembly Language Instructions 



Bitwise Logical-AND,3-Operand AND3 



Example 1 



AND3 *ARO--(IRO) ,*+ARl,R4 

Before Instruction: 

ARO = 8098F4h 

IR0 = 50h 

AR1 = 809951 h 

R4 = 0h 

Data at 8098F4h = 30h 

Dataat809952h = 123h 

LUFLV UF N Z V C = 00 

After Instruction: 



ARO = 8098A4h 

IR0 = 50h 

AR1 = 809951 h 

R4 = 020h 

Data at 8098F4h = 30h 

Dataat809952h = 123h 

LUF LV UF N Z V C = 







Example 2 



AND 3 *-AR5,R7,R4 

Before Instruction: 

AR5 = 80985Ch 

R7 = 2h 

R4 = 0h 

Data at 80985Bh = OAFFh 

LUF LV UF N Z V C = 

After Instruction: 

AR5 = 80985Ch 

R7 = 2h 

R4 = 2h 

Data at 80985Bh = OAFFh 

LUF LV UF N Z V C = 
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AND3IJSTI Parallel AND3 and STI 



Syntax 

Operation 

Operands 



Encoding 



Description 



Cycles 
Status Bits 



AND 
STI 



src2, srd, dstl 
src3, dst2 



srd AND src2-+ dstl 
|| src3 -*dst2 

srd register (Rn1 , < n1 < 7) 

src2 indirect (disp = 0, 1 , IRO, IR1) 

dstl register (Rn2, < n2 < 7) 

src3 register (Rn3, < n3 < 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 



31 




24 23 




1615 87 


I 
1 1 


i i I i 

10 


I I 
dstl 


i i 

srd 


src3 


dst2 


I I 1 I I I 1 

src2 



Mode Bit 



A bitwise logical-AND and an integer store are performed in parallel. All regis- 
ters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (AND3) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by the 
AND3. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 

1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 
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Assembly Language Instructions 



Parallel AND3 and STI AN D3 1 1 STI 



Example AND 3 * +AR1 ( IRO ) , R4 , R7 

| | STI R3,*AR2 

Before Instruction: 

AR1 =8099F1h 

IR0 = 8h 

R4 = 0A323h 

R7 = 0h 

R3 = 35h = 53 

AR2 = 80983Fh 

Data at 8099F9h = 5C53h 

Data at 80983Fh = Oh 

LUFLV UF N Z V C = 00 

After Instruction: 

AR1 =8099F1h 

R0 = 8h 

R4 = 0A323h 

R7 = 03h 

R3 = 35h = 53 

AR2 = 80983Fh 

Data at 8099F9h = 5C53h 

Data at 80983Fh = 35h = 53 

LUF LV UF N Z V C = 
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ANDN Bitwise Logical- AND With Complement 



Syntax 

Operation 

Operands 



Encoding 



31 



I I 





ANDN src, dst 
dst AND ~src^ dst 

src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 

1 1 immediate (not sign-extended) 

dst register (Rn, < n < 27) 



24 23 



—i — i — i — i — r 

110 



t — rrr 

dst 



16 15 



t — i — i — i — i — i — i — i — i — i — i — i — i — r 

src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The bitwise logical-AND between the dst operand and the bitwise logical com- 
plement (~) of the src operand is loaded into the dst register. The dst and src 
operands are assumed to be unsigned integers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

ANDN @980Ch,R2 

Before Instruction: 



DP = 80h 






R2 = 0C2Fh 






Data at 80980Ch = 0A02h 






LUFLV UF N Z V C = 








After Instruction: 







DP = 80h 

R2 = 042Dh 

Data at 80980Ch = 0A02h 

LUFLV UF N Z V C = 
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Assembly Language Instructions 



Bitwise Logical-ANDN, 3-Operand AN D N3 



Syntax 

Operation 

Operands 



Encoding 



31 



T 1— 

1 



ANDN3 src2, srd, dst 
srd AND ~src2 -> dst 

srd three-operand addressing modes (T): 
register (Rn1 , < n1 < 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 

1 register (Rn1 , < n1 < 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 
1 1 indirect (disp = 0, 1, IO0, IR1) 

dst register (Rn, < n < 27) 



24 23 



16 15 



I I I I I 

10 



i I r 

dst 



t i r 

srd 



t — i — i — i — r 

src2 



i — r 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 1 



The bitwise logical-AND between the srd operand and the bitwise logical 
complement (~) of the src2 operand is loaded into the dst register. The srd, 
src2, and dst operands are assumed to be unsigned integers. 

1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

ANDN3 R5,R3,R7 

Before Instruction: 

R5 = 0A02h 
R3 = 0C2Fh 
R7 = 0h 

LUFLV UFN Z V C = 
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ANDN3 Bitwise Logical-ANDN, 3-Operand 






After Instruction: 

R5 = 0A02h 
R3 = 0C2Fh 
R7 = 042Dh 

LUFLV UF N Z V C = 00 



Example? ANDN3 Rl, *AR5++ (IRO) , RO 

Before Instruction: 



R1 = OCFh 

AR5 = 809825h 

IR0 = 5h 

R0 = 0h 

Data at 809825h = OFFFh 

LUFLV UF N Z V C = 00 

After Instruction: 

R1 = OCFh 

AR5 = 80982Ah 

IR0 = 5h 

RO = 0F30h 

Data at 809825h = OFFFh 

LUFLV UF N Z V C = 00 



1 0-46 Assembly Language Instructions 



Arithmetic Shift ASH 



Syntax 
Operation 



Operands 



Encoding 



31 



T — T" 





ASH count, dst 

If (count >0): 

dst « count-* dst 

Else: 

dst» \count\ -» dst 

count general addressing modes (G): 
register (Rn, < n < 27) 

1 direct 

1 indirect 

1 1 immediate 

dst register (Rn, < n < 27) 



24 23 



—i — i — i — i — r 

111 



G 



t — i — r 

dst 



16 15 



87 



t — i — i — i — i — i — i — i — i — i — i — r 

count 



t — i — r 



Description 



Cycles 



The seven least significant bits of the count operand are used to generate the 
twos-complement shift count of up to 32 bits. 

If the count operand is greater than zero, the dst operand is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled, and 
high-order bits are shifted out through the C (carry) bit. 

Arithmetic left-shift: 
C <r- dst <- 

If the count operand is less than zero, the dst operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand 
are sign-extended as it is right-shifted. Low-order bits are shifted out through 
the C (carry) bit. 

Arithmetic right-shift: 
sign of dst -> dstC-> 

If the count operand is zero, no shift is performed, and the C (carry) bit is set 
to 0. The count and dst operands are assumed to be signed integers. 

1 
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ASH Arithmetic Shift 



Status Bits 



Mode Bit 
Example 1 



Example 2 



These condition flags are modified only if the destination register is R7 — 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Set to the value of the last bit shifted out. for a shift count of 0. 

OVM Operation is not affected by OVM bit value. 

ASH R1,R3 

Before Instruction: 



R0. 



R1 = 10h = 16 

R3 = OAEOOOh 

LUF LV UF N Z V C = 

After Instruction: 

R1 =10h 

R3 = OEOOOOOOOh 

LUF LV UF N Z V C = 

ASH @98C3h,R5 

Before Instruction: 







10 10 10 



DP = 80h 

R5 = 0AEC00001h 

Data at 8098C3h = 0FFE8 = - 24 

LUF LV UF N Z V C = 

After Instruction: 







DP = 80h 

R5 = OFFFFFFAEh 

Data at 8098C3h = 0FFE8 = - 24 

LUF LV UF N Z V C = 1 1 
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Assembly Language Instructions 



Arithmetic Shift, 3-Operand ASH3 



Syntax 
Operation 



Operands 



ASH3 count, src, dst 



If {count >0): 
src « count ■ 

Else: 



dst 



src» \count | -» c/sf 

coivrtfthree-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 

1 indirect (disp = 0,1, IRO, IR1 ) 
1 1 indirect (disp = 0, 1, IRO, IR1) 



Encoding 



31 



i r^ 

1 



src three-operand addressing modes (T): 
register (Rn1 , < n1 < 27) 

1 indirect (disp = 0, 1 , IRO, 1R1) 
10 register (Rn1, 0<n1 < 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

dst register (Rn, < n < 27) 



24 23 



I I I I I 

10 1 



i — i — r 

dst 



16 15 



i — i — i — i — i — i — r 

src 



t — i — i — i — i — r 

count 



Description 



The seven least significant bits of the count operand are used to generate the 
twos-complement shift count of up to 32 bits. 

If the count operand is greater than zero, the src operand is left-shifted by the 
value of the count operand. Low-order bits shifted in are zero-filled, and high- 
order bits are shifted out through the status register's C (carry) bit. 

Arithmetic left-shift: 
C <- src <r- 

If the count operand is less than zero, the src operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the srcoperand are 
sign-extended as they are right-shifted. Low-order bits are shifted out through 
the C (carry) bit. 

Arithmetic right-shift: 

sign of src^ src^ C 

If the count operand is zero, no shift is performed, and the C (carry) bit is set 
to 0. The count, src, and dst operands are assumed to be signed integers. 
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ASH3 Arithmetic Shift, 3-Operand 



Cycles 
Status Bits 



Mode Bit 
Example 



1 

These condition flags are modified only if the destination register is R7 — 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Set to the value of the last bit shifted out. for a shift count of 0. 

OVM Operation is not affected by OVM bit value. 
ASH3 *AR3--(1) , R5,R0 
Before Instruction: 



R0. 



AR3 = 809921 h 

R5 = 02B0h 

R0 = 0h 

Dataat809921h = 10h = 16 

LUFLV UFN Z V C = 







After Instruction: 

AR3 = 809920h 

R5 = 000002B0h 

R0 = 02B00000h 

Dataat809921h = 10h = 16 

LUF LV UF N Z V C = 

Example ASH3 R1,R3,R5 

Before Instruction: 

R1 = 0FFFFFFF8h = - 8 

R3 = OFFFFCBOOh 

R5 = 0h 

LUF LV UF N Z V C = 

After Instruction: 











R1 = 0FFFFFFF8h = - 8 

R3 = OFFFFCBOOh 

R5 = OFFFFFFCBh 

LUF LV UF N Z V C = 1 
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Assembly Language Instructions 



Parallel ASH3 and STI ASH3I ISTI 



Syntax 
Operation 



Operands 



ASH3 count, src2, dstl 
|| STI src3, dst2 

If (counf>0): 

src2« counts dstl 

Else: 

src2» \counf[ -> dstl 

|| src3->dst2 

count register (Rn1, < n1 <7) 

src2 indirect (disp = 0, 1 , IRO, IR1 ) 
dst 1 register (Rn2, < n2 < 7) 
src3 register (Rn3, < n3 < 7) 
dst2 indirect (disp = 0, 1 , IRO, IR1 ) 



Encoding 



31 



24 23 



1 1 



1 i 1 r^ 

10 1 



dstl 



1 — I- 
count 



1 — r 

src3 



1615 



1 — 1 — i — r 

dst2 



i — i — r 



src2 



Description 



The seven least significant bits of the count operand register are used to gener- 
ate the twos-complement shift count of up to 32 bits. 

If the count operand is greater than zero, the dst operand is left-shifted by the 
value of the count operand. Low-order bits shifted in are zero-filled, and high- 
order bits are shifted out through the C (carry) bit. 

Arithmetic left-shift: 
C <- src2 <- 

If the count operand is less than zero, the dst operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand are 
sign-extended as it is right-shifted. Low-order bits are shifted out through the 
C (carry) bit. 

Arithmetic right-shift: 

sign of src2 -> src2 -> C 

If the count operand is zero, no shift is performed, and the C (carry) bit is set 
to 0. The count and dst operands are assumed to be signed integers. 

All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STI) reads from a regis- 
ter and the operation being performed in parallel (ASH3) writes to the same 
register, then STI accepts as input the contents of the register before it is modi- 
fied by the ASH3. 
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ASH3| |STI Parallel ASH3 and STI 



If src2 and dst2 point to the same location, src2\s read before the write to dst2. 



Cycles 
Status Bits 



Mode Bit 
Example 



1 

These condition flags are modified only if the destination register is R7 — 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Set to the value of the last bit shifted out. for a shift count of 0. 

OVM Operation is not affected by OVM bit value. 

ASH3 R1,*AR6++(IR1) , R0 
I | STI R5,*AR2 

Before Instruction: 

AR6 = 809900h 

IR1 =8Ch 

R1 =0FFE8h = -24 

R0 = Oh 

R5 = 35h = 53 

AR2 = 8098A2h 

Data at 809900h = OAEOOOOOOh 

Data at 8098A2h = Oh 

LUFLV UF N Z V C = 00 

After Instruction: 

AR6 = 80998Ch 

IR1 =8Ch 

R1 =0FFE8h=-24 

R0 = OFFFFFFAEh 

R5 = 35h = 53 

AR2 = 8098A2h 

Data at 809900h = OAEOOOOOOh 

Data at 8098A2h = 35h = 53 

LUFLV UF N Z V C = 1 



RO. 
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Assembly Language Instructions 



Branch Conditionally (Standard) Bcond 



Syntax 
Operation 



Operands 



Bcond src 

If cond is true: 

If src is in register addressing mode (Rn, < n < 27), 

src -» PC. 
If src is in PC-relative mode (label or address), 

displacement + PC + 1 -> PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

register 

1 PC-relative 



Encoding 



31 



T 1— 

1 1 



— i — r- 

o 1 o 



24 23 



t — r- 





T — I — I— 

cond 



16 15 



i — i — i — i — i — i — i — i — i — i — i — i — r 

register or displacement 



Description 



Bconc/signif ies a standard branch that executes in four cycles. A branch is per- 
formed if the condition is true (since a pipeline flush also occurs on a true condi- 
tion; see Section 9.2 on page 9-4). If the src operand is expressed in register 
addressing mode, the contents of the specified register are loaded into the PC. 
If the src operand is expressed in PC-relative mode, the assembler generates 
a displacement: displacement = label - (PC of branch instruction + 1 ). This dis- 
placement is stored as a 1 6-bit signed integer in the 1 6 least significant bits of 
the branch instruction word. This displacement is added to the PC of the branch 
instruction plus 1 to generate the new PC. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Section 1 0.2 on pagel 0-9 for a list of condition mnemonics, en- 
coding, and flags). Condition flags are set on a previous instruction only when 
the destination register is one of the extended-precision registers (R7-R0) or 
when one of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, 
or TSTB3) is executed. 



Cycles 


4 




Status Bits 


LUF 


Unaffected 




LV 


Unaffected 




UF 


Unaffected 




N 


Unaffected 




Z 


Unaffected 




V 


Unaffected 




c 


Unaffected 


Mode Bit 


OVM 


DDeration is r 
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Bcond Branch Conditionally (Standard) 

Example BZ RO 

Before Instruction: 



PC = 2B00h 

RO = 0003FF00h 

LUFLV UF N Z V C = 00 

After Instruction: 

PC = 3FF00h 

RO = 0003FF00h 

LUFLV UF N Z V C = 00 



1 0-54 Assembly Language Instructions 



Branch Conditionally (Delayed) BcondD 



Syntax 
Operation 



Operands 



Encoding 



31 



BcondD src 

If cond is true: 

If src is in register addressing mode (Rn, < n < 27), 

src -» PC. 
If src is in PC-relative mode (label or address), 

displacement + PC + 3 -> PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

register 

1 PC-relative 



- 1 — i — i — i — i — 

11 1 



24 23 

—i — r— 





T — I — r 

cond 



16 15 



87 
"T — r 



i — i — i — i — i — i — i — i — i — i — i — r 

register or displacement 



i — r 



Description 



Cycles 
Status Bits 



Bcond D signifies a delayed branch that allows the three instructions after the 
delayed branch to be fetched before the PC is modified. The effect is a single- 
cycle branch, and the three instructions following Bcond D will not affect the 
cond. 

A branch is performed if the condition is true. If the src operand is expressed 
in register addressing mode, the contents of the specified register are loaded 
into the PC. If the srcoperand is expressed in PC-relative mode, the assembler 
generates a displacement: displacement = label - (PC of branch instruction + 
3). This displacement is stored as a 1 6-bit signed integer in the 1 6 least signifi- 
cant bits of the branch instruction. This displacement is added to the PC of the 
branch instruction plus 3 to generate the new PC. The TMS320C3x provides 
20 condition codes that can be used with this instruction (see Section 1 0.2 on 
page 10-9 for a list of condition mnemonics, encoding, and flags). Condition 
flags are set on a previous instruction only when the destination register is one 
of the extended-precision registers (R7-R0) or when one of the compare in- 
structions (CMPF, CMPF3, CMPI, CMP13, TSTB, or TSTB3) is executed. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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BcondD Branch Conditionally (Delayed) 

Example BNZD 36 (36 = 24h) 

Before Instruction: 



PC = 50h 

LUFLV UF N Z V C = 00 

After Instruction: 

PC = 77h 

LUFLV UF N Z V C = 00 



1 0-56 Assembly Language Instructions 



Branch Unconditionally (Standard) BR 



Syntax 
Operation 
Operands 
Encoding 



BR src 

src -» PC 

src long-immediate addressing mode 



31 






24 23 


16 15 




87 





1 1 

1 1 


I I 




I 







1 1 1 1 J I 1 1 1 1 1 1 I 1 i 1 1 1 1 1 1 1 1 

src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



BR signifies a standard branch that executes in four cycles, since a pipeline 
flush also occurs upon execution of the branch; see Section 9.2. An uncondi- 
tional branch is performed. The srcoperand is assumed to be a 24-bit unsigned 
integer. Note that bit 24 = for a standard branch. 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 

BR 8 05Ch 

Before Instruction: 

PC = 80h 

LUFLVUFNZVC = 0000 

After Instruction: 

PC = 805Ch 

LUFLV UF N Z V C = 00 
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BRD Branch Unconditionally (Delayed) 



Syntax 
Operation 
Operands 
Encoding 



BRD src 

src -» PC 

src long-immediate addressing mode 



31 






24 23 


16 15 




87 





I I 

1 1 


I I 




1 



1 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



BRD signifies a delayed branch that allows the three instructions after the 
delayed branch to be fetched before the PC is modified. The effect is a 
single-cycle branch. 

An unconditional branch is performed. The src operand is assumed to be a 
24-bit unsigned integer. Note that bit 24 = 1 for a delayed branch. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

BRD 2Ch 

Before instruction: 

PC = 1Bh 

LUF LV UF N Z V C = 

After Instruction: 

PC = 2Ch 

LUFLV UF N Z V C = 00 
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Assembly Language Instructions 



Call Subroutine CALL 



Syntax 
Operation 

Operands 
Encoding 



CALL src 

Next PC -> *++SP 

src -» PC 

src long-immediate addressing mode 



31 






24 23 


1615 




87 





1 1 

1 1 


I 1 




1 
1 





1 1 1 1 1 1 1 i 1 1 1 I 1 i 1 1 1 1 1 I 1 1 1 

src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



A call is performed. The next PC value is pushed onto the system stack. The 
src operand is loaded into the PC. The src operand is assumed to be a 24-bit 
unsigned immediate operand. 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

CALL 123456h 

Before Instruction: 

PC = 5h 

SP = 809801 h 

LUFLV UF N Z V C = 00 

After Instruction: 

PC = 1 23456h 

SP = 809802h 

Data at 809802h = 6h 

LUFLV UF N Z V C = 00 
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C^^Lcpjnd^ Call Subroutine Conditionally 



Syntax 
Operation 



Operands 



Encoding 



CALLcond src 

If cond is true: 
Next PC -» *++SP 
If src is in register addressing mode (Rn, < n < 27), 

src -> PC. 
If src is in PC-relative mode (label or address), 

displacement + PC + 1 -> PC. 

Else, continue. 

src conditional-branch addressing modes (B): 

register 

1 PC-relative 



31 



T — I — I 1 — I— 

11 1 



24 23 



t — r- 





t — i — r- 
cond 



1615 87 

t — i — i — i — i — i — i — i — i — i — i — i — i — i — r 



register or displacement 



Description 



A call is performed if the condition is true. If the condition is true, the next PC 
value is pushed onto the system stack. If the srcoperand is expressed in regis- 
ter addressing mode, the contents of the specified register are loaded into the 
PC. If the srcoperand is expressed in PC-relative mode, the assembler gener- 
ates a displacement: displacement = label - (PC of call instruction + 1). This 
displacement is stored as a 1 6-bit signed integer in the 1 6 least significant bits 
of the call instruction word. This displacement is added to the PC of the call 
instruction plus 1 to generate the new PC. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Section 10.2 for a list of condition mnemonics, encoding, and 
flags). Condition flags are set on a previous instruction only when the destina- 
tion register is one of the extended-precision registers (R7-R0) or when one 
of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) 
is executed. 



Cycles 


5 




Status Bits 


LUF 


Unaffected. 




LV 


Unaffected. 




UF 


Unaffected. 




N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 


Mode Bit 


OVM( 


^Deration is n 



OVM Operation is not affected by OVM bit value. 
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Assembly Language Instructions 



Call Subroutine Conditionally CALLcond 



Example CALLNZ R5 

Before Instruction: 



PC = 123h 

SP = 809835h 

R5 = 789h 

LUFLV UF N Z V C = 00 

After Instruction: 

PC = 789h 

SP = 809836h 

R5 = 789h 

Dataat809836h = 124h 

LUFLV UF N Z V C = 00 
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CMPF Compare Floating-Point 



Syntax 


CMPF src, dst 


Operation 


dst - src 


Operands 


src general addressing modes (G): 

register (Rn, < n < 7) 

1 direct 

1 indirect 
1 1 immediate 



Encoding 



31 



l I 





Description 

Cycles 
Status Bits 



Mode Bit 
Example 



dst register (Rn, < n < 7) 



24 23 



~i — r- 

10 



t — i — r 

dst 



16 15 



t — i — i — r 



87 



src 



The src operand is subtracted from the dst operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The dst and src 
operands are assumed to be floating-point numbers. 

1 

These condition flags are modified for all destination registers (R27 — R0). 

LUF 

LV 

UF 

N 

Z 

V 

c 



1 if a floating-point underflow occurs, unchanged otherwise. 

1 if a floating-point overflow occurs, unchanged otherwise. 

1 if a floating-point underflow occurs, otherwise. 

1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if a floating-point overflow occurs, otherwise. 

Unaffected. 



OVM Operation is not affected by OVM bit value. 

CMPF *+AR4,R6 

Before Instruction: 

AR4 = 8098F2h 

R6 = 070C800000h = 1 .4050e+02 

Data at 8098F3h = 070C8000h = 1 .4050e + 02 

LUFLVUFNZVC = 00 

After Instruction: 

AR4 = 8098F2h 

R6 = 070C800000h = 1 .4050e + 02 

Data at 8098F3h = 070C8000h = 1 .4050e + 02 

LUF LV UF N Z V C = 1 
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Assembly Language Instructions 



Compare Floating-Point, 3-Operand CMPF3 



Syntax 

Operation 

Operands 



CMPF3 src2, srd 

srd -src2 

srd three-operand addressing modes (T): 
register (Rn1, < n1 <7) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rn1 , 0<n1 <7) 

1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 7) 

1 register (Rn2, < n2 < 7) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 
1 1 indirect (disp = 0, 1 , IRO, IR1) 



Encoding 



31 


24 23 




16 15 




87 







i i 

1 


I I I I I 

110 


1 
T 


I 1 1 I 




1 1 1 1 1 1 1 

srd 


1 I I I I I I 

src2 



Description 



Cycles 
Status Bits 



Mode Bit 



The src2operand is subtracted from the srd operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The srd and src2 
operands are assumed to be floating-point numbers. Although this instruction 
has only two operands, it is designated as a three-operand instruction because 
operands are specified in the three-operand format. 

1 

These condition flags are modified for all destination registers (R27 — R0). 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 
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CMPF3 Compare Floating-Point, 3-Operand 

Example CMPF3 *AR2, *AR3--(1) 

Before Instruction: 



AR2 = 809831 h 

AR3 = 809852h 

Data at 809831 h = 77A7000h = 2.5044e + 02 

Data at 809852h = 57A2000h = 6.2531 25e + 01 

LUFLV UF N Z V C = 00 

After Instruction: 

AR2 = 809831 h 

AR3 = 809851 h 

Data at 809831 h = 77A7000h = 2.5044e + 02 

Data at 809852h = 57A2000h = 6.2531 25e + 01 

LUF LV UF N Z V C = 1 



1 0-64 Assembly Language Instructions 



Compare Integer CMPI 



Syntax 

Operation 

Operands 



Encoding 



31 



1 1— 





CMPI src, dst 
dst - src 

src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 27) 



24 23 



— i — i — i — i — r 

10 1 



t — i — i — r 

dst 



16 15 



87 



t — i — i — i — i — i — i — i — i — i — i — i — i — r 

src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The src operand is subtracted from the dst operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The dst and src 
operands are assumed to be signed integers. 

1 

These condition flags are modified for all destination registers (R27 — R0). 

LUF 

LV 

UF 

N 

Z 

V 

c 



Unaffected. 

1 if an integer overflow occurs, unchanged otherwise. 

0. 

1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an integer overflow occurs, otherwise. 

1 if a borrow occurs, otherwise. 



OVM Operation is not affected by OVM bit value. 

CMPI R3,R7 

Before Instruction: 



R3 = 898h = 2200 
R7 = 3E8h = 1000 

LUFLV UF N Z V C = 00 

After Instruction: 

R3 = 898h = 2200 
R7 = 3E8h = 1000 
LUF LV UF N Z V C = 1 
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CMPI3 Compare Integer, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



31 



1 



CMPI3 src2, srd 

srd-src2 

srd three-operand addressing modes (T): 
register (Rn1 , < n1 < 27) 

1 indirect (disp = 0, 1, IRO, IR1) 
10 register (Rn1,0<n1 <27) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 
10 indirect (disp = 0, 1 , IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 



24 23 



— t — i — i — i — r 

111 



—i — i — i — r 





1615 



t — i — r 

srd 



t — r 



87 



t — i — i — i — i — i — r 

src2 



Description 



Cycles 
Status Bits 



The src2operand is subtracted from the srd operand. The result is not loaded 
into any register, thus allowing for nondestructive compares. The srd and src2 
operands are assumed to be signed integers. Although this instruction has only 
two operands, it is designated as a three-operand instruction because oper- 
ands are specified in the three-operand format. 



1 

These condition flags are modified for all destination registers (R27 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 



RO). 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Assembly Language Instructions 



Compare Integer, 3-Operand CMPI3 



Example CMP 1 3 R7,R4 

Before Instruction: 



R7 = 03E8h = 1000 
R4 = 0898h = 2200 

LUFLV UF N'Z V C = 

After Instruction: 

R7 = 03E8h = 1000 
R4 = 0898h = 2200 
LUFLV UF N Z V C = 00 
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D B CO n d Decremen t and Branch Conditionally (Standard) 



Syntax 
Operation 



Operands 



Encoding 



31 



DBcond ARn, src 

ARn-1 ->ARn 

If cond is true and ARn > : 

If src is in register addressing mode (Rn, < n < 27), 

src -» PC. 
If src is in PC-relative mode (label or address), 

displacement + PC + 1 -» PC. 
Else, continue. 

src conditional-branch addressing modes (B): 

register 

1 PC-relative 

ARn register (0 < n < 7) 



24 23 



i i i i I 

11 11 



ARn 



t — r~i — r 
cond 



16 15 



87 



i — i — i — i — i — i — i — i — i — i — i — i — i — r 

register or displacement 



Description 



DBcond signifies a standard branch that executes in four cycles because the 
pipeline must be flushed if cond is true. The specified auxiliary register is de- 
cremented and a branch is performed if the condition is true and the specified 
auxiliary register is greater than or equal to zero. The condition flags are those 
set by the last previous instruction that affects the status bits. 

The auxiliary register is treated as a 24-bit signed integer. The most significant 
eight bits are unmodified by the decrement operation. The comparison of the 
auxiliary register uses only the 24 least significant bits of the auxiliary register. 
Note that the branch condition does not depend on the auxiliary register decre- 
ment. 

If the src operand is expressed in register addressing mode, the contents of 
the specified register are loaded into the PC. If the src operand is expressed 
in PC-relative addressing mode, the assemblergenerates adisplacement: dis- 
placement = label - (PC of branch instruction + 1). This integer is stored as a 
1 6-bit signed integer in the 1 6 least significant bits of the branch instruction 
word. This displacement is added to the PC of the branch instruction plus 1 to 
generate the new PC. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Section 10.2 for a list of condition mnemonics, encoding, and 
flags). Condition flags are set on a previous instruction only when the destina- 
tion register is one of the extended-precision registers (R0-R7) or when one 
of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, orTSTB3) 
is executed. 
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Assembly Language Instructions 



Decrement and Branch Conditionally (Standard) D B CO n d 



Cycles 


4 


Status Bits 


LUF Unaffected. 




LV Unaffected. 




UF Unaffected. 




N Unaffected. 




Z Unaffected. 




V Unaffected. 




C Unaffected. 


Mode Bit 


OVM Operation is not affected by OVM bit value. 


Example 


CMPI 200, R3 




DBLT AR3,R2 




Before Instruction: 




PC = 5Fh 




AR3 = 12h 




R2 = 9Fh 




R3 = 80h 




LUF LV UF N Z V C = 1 




After Instruction: 



PC = 9Fh 

AR3 = 11h 

R2 = 9Fh 

R3 = 80h 

LUF LV UF N Z V C = 1 
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DBcondD Decrement and Branch Conditionally (Delayed) 



Syntax 
Operation 



Operands 



Encoding 



31 



T — I— 
1 1 



DBcondD ARn, src 

ARn-1 ->ARn 

If cond is true and ARN > 0: 

If src is in register addressing mode (Rn, < n < 27) 

src -> PC 
If src is in PC-relative mode (label or address) 
displacement + PC + 3 -> PC. 

Else, continue. 

src conditional-branch addressing modes (B): 
0' register 
1 PC-relative 

ARn register (0 < n < 7) 



—i — r- 

1 1 



24 23 



T" 
ARn 



i — i — r 

cond 



16 15 



87 



i — i — i — i — i — i — i — i — i — i — i — i — i — r 

register or displacement 



Description 



DBcond D signifies a delayed branch that allows the three instructions after the 
delayed branch to be fetched before the PC is modified, The effect is a single- 
cycle branch. The specified auxiliary register is decremented and a branch is 
performed if the condition is true and the specified auxiliary register is greater 
than or equal to zero. The condition flags are those set by the last previous in- 
struction that affects the status bits. The three instructions following the 
DBcondD do not affect the cond. 

The auxiliary register is treated as a 24-bit signed integer. The most significant 
eight bits are unmodified by the decrement operation. The comparison of the 
auxiliary register uses only the 24 least significant bits of the auxiliary register. 
Note that the branch condition does not depend on the auxiliary register decre- 
ment. 

If the src operand is expressed in register addressing mode, the contents of 
the specified register are loaded into the PC. If the src is expressed in PC-rela- 
tive addressing, the assembler generates a displacement: displacement = la- 
bel - (PC of branch instruction + 3). This displacement is added to the PC of 
the branch instruction plus 3 to generate the new PC. Note that bit 21 = 1 for 
a delayed branch. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Section 10.2 for a list of condition mnemonics, encoding, and 
flags). Condition flags are set on a previous instruction only when the destina- 
tion register is one of the extended-precision registers (R7-R0) or when one 
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Assembly Language Instructions 



Decrement and Branch Conditionally (Delayed) DBcond D 



of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) 
is executed. 



Cycles 


1 




Status Bits 


LUF 


Unaffected. 




LV 


Unaffected. 




UF 


Unaffected. 




N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 


Mode Bit 


OVM Operation is not affe 


Example 


CMPI 


0,R2 




DBZD 


AR5, $+110h 




Before Instruction: 



PC = 100h 

R2 = 26h 

AR5 = 67h 

LUF LV UF N Z V C = 

After Instruction: 







PC = 21 Oh 

R2 = 26h 

AR5 = 66h 

LUF LV UF N Z V C = 
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FIX Floating-Point to Integer Conversion 



Syntax 


FIX src, dst 


Operation 


tix(src) -> dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 7) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



t — r— 





dst register (Rn, < n < 27) 



24 23 



- 1 — i — i — i — r 

10 1 



t — r 

dst 



1615 



87 



t — i — i — i — r — i — i — i — i — i — i — i — i — i — r 



Description 



Cycles 
Status Bits 



Mode Bit 



The floating-point operand src is converted to the nearest integer less than or 
equal to it in value, and the result is loaded into the dst register. The srcoperand 
is assumed to be a floating-point number and the dst operand a signed integer. 

The exponent field of the result register (if it has one) is not modified. 

Integer overflow occurs when the floating-point number is too large to be repre- 
sented as a 32-bit twos-complement integer. In the case of integer overflow, 
the result will be saturated in the direction of overflow. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 



10-72 



Assembly Language Instructions 



Floating-Point to Integer Conversion FIX 



Example FIX R1,R2 

Before Instruction: 



R1 = 0A28200000h = 1 .3454e + 3 
R2 = 0h 

LUFLV UF N Z V C = 00 

After Instruction: 

R1 = 0A28200000h = 13454e + 3 

R2 = 541 h = 1345 

LUFLV UF N Z V C = 00 



10-73 



FIXNSTI 



Parallel FIX and STI 



Syntax 

Operation 

Operands 

Encoding 



FIX src2, dstl 
|| STI src3, dst2 

fix(src£) -> dstl 
|| src3-*dst2 

src2 indirect (disp = 0,1, IRO, IR1 ) 

dstl register (Rn1 , < n1 < 7) 

src3 register (Rn2, < n2 < 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



31 



1 1 



Description 



Cycles 
Status Bits 



— i — i — i — r - 

10 10 



24 23 



16 15 



87 



i — r 
dstl 



— I — l — 





src3 



t — i — i — i — i — i — r 

dst2 



i — r 



i — i — r 
src2 



i — r 



Mode Bit 



A floating-point to integer conversion is performed. All registers are read at the 
beginning and loaded at the end of the execute cycle. This means that if one 
of the parallel operations (STI) reads from a register, and the operation being 
performed in parallel (FIX) writes to the same register, then STI accepts as in- 
put the contents of the register before it is modified by FIX. 

If src2 and dst2 point to the same location, src2\s read before the write to dst2. 

Integer overflow occurs when the floating-point number is too large to be repre- 
sented as a 32-bit twos-complement integer. In the case of integer overflow, 
the result will be saturated in the direction of overflow. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 



10-74 



Assembly Language Instructions 



Parallel FIXandSTI FIX||STI 



Example fix * + +AR4 ( l ) , ri 

|| STI R0,*AR2 

Before Instruction: 



AR4 = 8098A2h 














R1 = Oh 














RO = ODCh = 220 














AR2 = 80983Ch 














Data at 8098A3h = 


733C000h ■ 


= 1 7950e 


+ 02 




Data at 80983Ch = 


Oh 












LUF LV UF N Z 


V 


C = 














After Instruction: 















AR4 = 8098A3h 

R1 =0B3h = 179 

RO = ODCh = 220 

AR2 = 80983Ch 

Data at 8098A3h = 733C000h = 1 .79750e + 02 

Data at 80983Ch = ODCh = 220 

LUFLV UF N Z V C = 00 



10-75 



FLOAT 



Integer to Floating-Point Conversion 



Syntax 


FLOAT src, dst 


Operation 


float (src) -» dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 27) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



31 



t — r~ 





dst register (Rn, < n < 7) 



24 23 



—i — i — i — i — r 

10 11 



t — i — i — r 

dst 



16 15 



T 1 1 — I 1 T 



87 



t — i — r 

src 



t — i — i — i — i — r 



The integer operand src is converted to the floating-point value equal to it, and 
the result loaded into the dst register. The src operand is assumed to be a 
signed integer, and the dst operand a floating-point number. 



1 

These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

FLOAT *++AR2 (2) ,R5 

Before Instruction: 

AR2 = 809800h 

R5 = 034C2000h = 1 .275781 25e + 01 

Data at 809802h = OAEh = 1 74 

LUFLV UF N Z V C = 00 

After Instruction: 

AR2 = 809802h 

R5 = 072E00000h = 1 .74e + 02 

Data at 809802h = OAEh = 174 

LUFLV UF N Z V C = 00 



R0. 



10-76 



Assembly Language Instructions 



Parallel FLOAT and STF FLOATI ISTF 



Syntax 



Operation 



Operands 



Encoding 



31 



FLOAT src2,dst1 
|| STF src3, dst2 

i\oa\(src2 ) -» dstl 
|| src3^dst2 

src2 indirect (disp = 0, 1, IRO, IR1) 

dstl register (Rn1 , < n1 < 7) 

src3 register (Rn2, < n2 3 7) 

dst2 register (disp = 0, 1 , IRO, IR1 ) 



i — 
1 1 



■—i — i — i — r— 

10 11 



24 23 



t — r 
dstl 



— i — i — 





i — r 
src3 



16 15 



"i — i — i — i — r 

dst2 



t— r 



87 



i i i r 

src2 



i — r 



Description 



An integer to floating-point conversion is performed. All registers are read at 
the beginning and loaded at the end of the execute cycle. This means that if 
one of the parallel operations (STF) reads from a register and the operation 
being performed in parallel (FLOAT) writes to the same register, then STF ac- 
cepts as input the contents of the register before it is modified by FLOAT. 

If src2and dst2po\ntto the same location, src2\s read before the write to dst2. 



Cycles 
Status Bits 



Mode Bit 



1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is affected by OVM bit value. 



10-77 



FLO AT] J STF Parallel FLOAT and STF 



Example FLOAT *+AR2 (IRO) ,R6 

I I STF R7, *AR1 

Before Instruction: 



AR2 = 8098C5h 

IR0 = 8h 

R6 = 0h 

R7 = 034C200000h = 1 .275781 25e + 01 

AR1 = 809933h 

Data at 8098CDh = OAEh = 174 

Data at 809933h = Oh 

LUFLV UF N Z V C = 00 

After Instruction: 

AR2 = 8098C5h 

IR0 = 8h 

R6 = 072E000000h = 1 .740e + 02 

R7 = 034C200000h = 1 .275781 25e + 01 

AR1 = 809933h 

Data at 8098CDh = OAEh = 174 

Data at 809933h = 034C2000h = 1 .275781 25e + 01 

LUFLV UF N Z V C = 00 



1 0-78 Assembly Language Instructions 



Interrupt Acknowledge IACK 



Syntax 
Operation 

Operands 
Encoding 



31 



T — I— 





IACK src 



Perform a dummy read ope ration with IACK = 0. 
At end of dummy read, set IACK to 1 . 

src general addressing modes (G): 
1 direct 
1 indirect 



24 23 



— i — i — i — i — r 

11 11 o 



—i — i — i — r— 





16 15 



i — i — i — i — i — i — r 



87 



t — i — i — r 

src 



t — i — r 



Description 



A du mmy re ad operation is performed with IACK = 0. At the end of the dummy 
read, IACK is set to 1 . This instruction can be used to generate an external in- 
terrupt acknowledge. If the add ress sp ecified is off-chip, a read operation from 
that address is performed. The IACK signal and the address can then be used 
to signal interrupt acknowledge to external devices. The data read by the pro- 
cessor is unused. 



Cycles 


1 


Status Bits 


LUF Unaffected. 




LV Unaffected. 




UF Unaffected. 




N Unaffected. 




Z Unaffected. 




V Unaffected. 




C Unaffected. 


Mode Bit 


OVM Operation is not affected by OVM bit value. 


Example 


IACK *AR5 




Before Instruction: 




IACK = 1 




PC = 300h 




LUFLV UF N Z V C = 00 




After Instruction: 



IACK = 1 
PC = 301h 

LUFLV UF N Z V C = 00 
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IDLE Idle Until Interrupt 



Syntax 


IDLE 


Operation 


1 -> ST(GIE) 




Next PC -> PC 




Idle until interrupt 


Operands 


None 


Encoding 





31 


24 23 


1615 


87 


i i 




I I i 1 1 

110 


I I I I I I I I I I I I 

00 00 000000 00 


I I I I I I i i i i 

00 000000000 



Description 



The global interrupt enable bit is set, the next PC value is loaded into the PC, 
and the CPU idles until an interrupt is received. When the interrupt is received, 
the contents of the PC are pushed onto the active system stack. 



Cycles 
Status Bits 



Mode Bit 



1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 



10-80 



Assembly Language Instructions 



Load Floating-Point Exponent LDE 



Syntax 

Operation 

Operation 



Encoding 



31 



T 1 — 





LDE src, dst 
src(exp) -> dst{exp) 

src general addressing modes (G): 

register (Rn, < n < 7) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 7) 



24 23 



16 15 



-I — I — I — I — 1 — 

110 1 



i i i r 

dst 



t — i — i — i — i — i — r 



87 



t — r 

src 



t — i — i— i — i — r 



Description 



The exponent field of the src operand is loaded into the exponent field of the 
dst register. No modification of the dst register mantissa field is made unless 
the value of the exponent loaded is the reserved value of the exponent for zero 
as determined by the precision of the src operand. Then the mantissa field of 
the cfef register is set to zero. The src and dst operands are assumed to be floa- 
ting-point numbers. Immediate values are evaluated in the short floating point 
format. 



Cycles 


1 




Status Bits 


LUF 


Unaffected. 




LV 


Unaffected. 




UF 


Unaffected. 




N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 


Mode Bit 


OVM Operation is nc 


Example 


LDE 


R0,R5 




Before Instruction: 



R0 = 0200056F30h = 4.00066337e + 00 
R5 = 0A056FE332h = 1 .06749648e + 03 
LUF LV UF N Z V C = 

After Instruction: 



R0 = 0200056F30h = 4.00066337e + 00 
R5 = 02056FE332h = 4.1 699081 4e + 00 
LUF LV UF N Z V C = 
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LDF 



Load Floating-Point 



Syntax 

Operation 

Operands 



Encoding 



31 



I I 





LDF src, dst 
src -> dst 

src general addressing modes (G): 

register (Rn, < n < 7) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 7) 



24 23 



1615 



I I I I I 

1110 



dst 



i — r 



i — r 



, 87 

i — 1 i — i — i — i — i — r 



t — r 



src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The src operand is loaded into the dst register. The dst and src operands are 
assumed to be floating-point numbers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

LDF @9800h,R2 

Before Instruction: 

DP = 80h 

R2 = Oh 

Data at 809800h = 10C52A00h = 2.19254303e + 00 

LUFLV UF N Z V C = 00 

After Instruction: 

DP = 80h 

R2 = 010C52A000h = 2.19254303e + 00 

Data at 809800h = 10C52A00h = 2.19254303e + 00 

LUFLV UF N Z V C = 00 



10-82 



Assembly Language Instructions 



Load Floating-Point Conditionally LDFcond 



Syntax 


LDFcond src, dst 


Operation 


If cond is true: 
src -» dst. 




Else: 
dst is unchanged. 


Operands 


src general addressing modes (G): 

register (Rn, < n < 7) 

1 direct 

1 indirect 
1 1 immediate 



Encoding 



31 



dst register (Rn, < n < 7) 



t — i — i — 

1 



24 23 



t — i — i — r 

cond 



t — r 
dst 



16 15 



87 



t — i — i — i — i — i — i — i — i — i — i — i — i — i — r 

src 



Description 



Cycles 
Status Bits 



If the condition is true, the src operand is loaded into the dst register. Other- 
wise, the dst register is unchanged. The dst and src operands are assumed 
to be floating-point numbers. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Section 10.2 for a list of condition mnemonics, encoding, and 
flags). Note that an LDFU (load floating-point unconditionally) instruction is 
useful for loading R7 — R0 without affecting condition flags. Condition flags 
are set on a previous instruction only when the destination register is one of 
the extended-precision registers (R7-R0) or when one of the compare instruc- 
tions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) is executed. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



Mode Bit 



OVM Operation is not affected by OVM bit value. 



10-83 



LDFcond Load Floating-Point Conditionally 

Example LDFZ R3,R5 

Before Instruction: 



R3 = 2CFF2CD500h = 1.77055560e +13 
R5 = 5F0000003Eh = 3.961 40824e + 28 
LUFLV UF N Z V C = 1 

After Instruction: 

R3 = 2CFF2CD500h = 1 .77055560e +1 3 
R5 = 2CFF2CD500h = 1 .77055560e +1 3 
LUF LV UF N Z V C = 1 



1 0-84 Assembly Language Instructions 



Load Floating-Point, Interlocked LDFI 



Syntax 


LDFI src, dst 


Operation 


Signal interlocked operation. 

src -» dst 


Operands 


src general addressing modes (G): 
1 direct 
1 indirect 



Encoding 



31 



- 1 — r - 





dst register (Rn, < n < 7) 



24 23 
— T 



-I — i — i — r 

1111 



1615 



t — i — i — r 

dst 



87 

t — i — i — i — i — i — i — i — r 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The src operand is loaded into the dst register. An interlocked operation is sig- 
naled over XF0 and XF1 . The srcand dst operands are assumed to be floating- 
point numbers. Note that only direct and indirect modes are allowed. Refer to 
Section 6.4 for detailed description. 

1 if XF1 = (see Section 6.4) 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

LDFI *+AR2,R7 

Before Instruction: 



AR2 = 8098F1h 

R7 = Oh 

Data at 8098F2h = 584C000h = - 6.281 25e + 01 

LUFLV UFNZ V C = 

After Instruction: 

AR2 = 8098F1h 

R7 = 0584C00000h = - 6.281 25e + 01 

Data at 8098F2h = 584C000h =- 6.281 25e + 01 

LUF LV UF N Z V C = 1 



10-85 



LDF||LDF Parallel LDF and LDF 



Syntax 



Operation 



Operands 



LDF src2, dst2 
|| LDF srd, dstl 

src2 -> dst2 
|| srd -> dstl 

srd indirect (disp = 0, 1 , IRO, IR1 ) 

dstl register (Rn1 , < n1 < 7) 

src2 indirect (disp = 0, 1, IRO, IR1) 

dst2 register (Rn2, < n2 < 7) 



Encoding 



31 



1 1 



~ i — i — i — r— 

10 



24 23 

— i — r 

dst2 



i — r 
dstl 



16 15 







i — i — i — i — i — i — r 

srd 



8 7 

t — i — i — i — i — i — r 



src2 



Description 



Two floating-point loads are performed in parallel. If the LDFs load the same 
register, the assembler issues a warning. The result is that of LDF src2, dst2. 



Cycles 
Status Bits 



Mode Bit 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 



10-86 



Assembly Language Instructions 



Parallel LDF and LDF LDFIILDF 



Example LDF *-- AR1 (IRO) , R7 

| | LDF *AR7++(1) ,R3 

Before Instruction: 



AR1 = 80985Fh 

IR0 = 8h 

R7 = 0h 

AR7 = 80988Ah 

R3 = 0h 

Data at 809857h = 70C8000h = 1 .4050e + 02 

Data at 80988Ah = 57B4000h = 6.281 250e + 01 

LUFLVUFNZVC = 0000000 

After Instruction: 

AR1 = 809857h 

R0 = 8h 

R7 = 070C800000h = 1.4050e + 02 

AR7 = 80988Bh 

R3 = 057B400000h = 6.281 250e + 01 

Data at 809857h = 70C8000h = 1 .4050e + 02 

Data at 80988Ah = 57B4000h = 6.281 250e + 01 

LUF LV UF N Z V C = 



10-87 



LDFIISTF Parallel LDF and STF 



Syntax 



Operation 



Operands 



Encoding 



31 



LDF src2, dstl 
|| STF src3, ds\2 

src2 -» dstl 
|| src3^dst2 

src2 indirect (disp = 0, 1, IRO, IR1) 
dst 1 register (Rn1 , < n1 < 7) 
src3 register (Rn2, < n2 < 7) 
dst2 indirect (disp = 0, 1 , IRO, IR1 ) 



24 23 



1615 



1 1 



-j — i — i — r - 

110 



t — r 

dstl 



— T 1 — 





i — r 

src3 



t — i — i — i — r 

dst2 



t — r 



87 



t — i — i — i — i — i — r 

src2 



Description 

Cycles 
Status Bits 



Mode Bit 



A floating-point load and a floating-point store are performed in parallel. 
If src2 and dst2 point to the same location, src2 is read before the write to dst2. 
1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 



10-88 



Assembly Language Instructions 



Parallel LDF and STF LDFjjSTF 



Example LDF *AR2- - ( 1 ) , Rl 

| | STF R3, *AR4++(IR1) 

Before Instruction: 



AR2 = 8098E7h 

R1 = Oh 

R3 = 057B400000h = 6.281 25e + 01 

AR4 = 809900h 

IR1 = 10h 

Data at 8098E7h = 70C8000h = 1 .4050e + 02 

Data at 809900h = Oh 

LUFLV UF N Z V C = 00 

After Instruction: 

AR2 = 8098E6h 

R1 = 070C800000h = 1.4050e + 02 

R3 = 057B400000h = 6.281 25e + 01 

AR4 = 80991 Oh 

IR1 =10h 

Data at 8098E7h = 70C8000h = 1 .4050e + 02 

Data at 809900h = 57B4000h = 6.281 25e + 01 

LUFLV UF N Z V C = 00 



10-89 



LDI Load Integer 




Syntax 


LDI src, dst 


Operation 


src -> dst 


Operands 


src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 



Encoding 



31 







dst register (Rn, < n < 27) 



24 23 



~~i i i i r 

1 



t — r 

dst 



16 15 



t — i — i — i — i — i — r 



87 



-\ — r 
src 



t i i i i r 



Description 



Cycles 
Status Bits 



Mode Bit 



The src operand is loaded into the dst register. The dst and src operands are 
assumed to be signed integers. An alternate form of LDI, LDP, is used to load 
the data page pointer register (DP), or any other register with the eight MSBs 
of a relocatable address. See the LDP instruction and subsection 10.3.2. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 



10-90 



Assembly Language Instructions 



Load Integer LDI 



Example ldi *-ari (IRO) ,R5 

Before Instruction: 



AR1 = 2Ch 

IR0 = 5h 

R5 = 3C5h = 965 

Data at 27h = 26h = 38 

LUF LV UF N Z V C = 

After Instruction: 

AR1 = 2Ch 

IR0 = 5h 

R5 = 26h = 38 

Data at 27h = 26h = 38 

LUFLV UF N Z V C = 00 



10-91 



LDICOnd Load Integer Conditionally 



Syntax 


LD\cond src, dst 


Operation 


If conc/is true: 
src -¥ dst, 




Else: 
dst is unchanged. 


Operands 


src general addressing mode. 




register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 




dst register (Rn, < n < 27) 


Encoding 





31 



T 1 1 — 

10 1 



24 23 



t — i — r 

cond 



i — i — r 
dst 



1615 



i n r 



87 

i — i — r 



t — i — i — i — i — r 



src 



Description 



Cycles 
Status Bits 



If the condition is true, the src operand is loaded into the dst register. Other- 
wise, the dst register is unchanged. The dst and src operands are assumed 
to be signed integers. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Section 1 0.2 for a list of condition mnemonics, encoding, and 
flags). Note that an LDIU (load integer unconditionally) instruction is useful for 
loading R7 — R0 without affecting the condition flags. Condition flags are set 
on a previous instruction only when the destination register is one of the exten- 
ded-precision registers (R7-R0) or when one of the compare instructions 
(CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) is executed. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



Mode Bit 



OVM Operation is not affected by OVM bit value. 



10-92 



Assembly Language Instructions 



Load Integer Conditionally LDICOnd 



Example ldiz R4,R6 

Before Instruction: 



R4 = 027Ch = 636 

R6 = 0FE2h = 4,066 

LUFLV UFNZ V C = 

After Instruction: 

R4 = 027Ch = 636 

R6 = 0FE2h = 4,066 

LUFLV UF N Z V C = 00 



10-93 



LDII Load Integer, Interlocked 



Syntax 


LDII src, dst 


Operation 


Signal interlocked operation. 
src -> dst 


Operands 


src general addressing modes (G) 
1 direct 
1 indirect 



Encoding 



31 



t — r~ 





dst register (Rn, < n < 27) 



24 23 



— 1 — I 1 1 T 

1 1 



i — i — r 
dst 



16 15 



87 



i — i — i — i — i — i — i — i — i — i — r 

src 



i — i — i — r 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The srcoperand is loaded into the dst register. An interlocked operation is sig- 
naled over XF0 and XF1 . The src and dst operands are assumed to be signed 
integers. Note that only the direct and indirect modes are allowed. Refer to 
Section 6.4 for detailed description. 



1 if XF = (see Section 6.4 on page 6-1 0) 

These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

LDII @985Fh,R3 

Before Instruction: 







R0. 



DP = 80 




R3= Oh 




Data at 80985Fh = ODCh 




LUF LV UF N Z V C = 


= 


After Instruction: 





DP = 80 

R3 = ODCH 

Data at 80985Fh = ODCh 

LUF LV UF N Z V C = 
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Assembly Language Instructions 



Parallel LDI and LDI LDI| | LDI 



Syntax 

Operation 

Operands 

Encoding 



31 



LDI src2, dst2 
|| LDI srd,dst1 

src2 -> dst2 
|| srd -> dstl 

srd indirect (disp = 0, 1, IRO, IR1) 

dstl register (Rn1 , < ni < 7) 

src2 indirect (disp = 0, 1 , IRO, IR1) 

dst2 register (Rn2, < n2 < 7) 



i — 
1 1 



— T — I — I — r - 

11 



24 23 



dst2 



t — r 

dstl 



~i — r 





16 15 



t — r 



t — i — r 

srd 



87 



t — i — i — r 

src2 



Description 

Cycles 
Status Bits 



Mode Bit 



Two integer loads are performed in parallel. A warning is issued by the assem- 
bler if the LDIs load the same register. The result is that of LDI src2, dst2. 

1 



LUF 


Unaffected 


LV 


Unaffected 


UF 


Unaffected 


N 


Unaffected 


Z 


Unaffected 


V 


Unaffected 


c 


Unaffected 



OVM Operation is not affected by OVM bit value. 
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LDI 1 1 LDI Parallel LDI and LDI 



Example LDI *-ARl(l),R7 

I | LDI *AR7++(IR0) ,R1 

Before Instruction: 



AR1 = 809826h 

R7 = 0h 

AR7 = 8098C8h 

IR0 = 10h 

R1 =0h 

Data at 809825h = OFAh = 250 

Data at 8098C8h = 2EEh = 750 

LUFLV UF N Z V C = 00 

After Instruction: 

AR1 = 809826h 

R7 = OFAh = 250 

AR7 = 8098D8h 

IR0 = 10h 

R1=02EEh = 750 

Data at 809825h = OFAh = 250 

Data at 8098C8h = 2EEh = 750 

LUFLV UF N Z V C = 00 
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Parallel LDI and STI LDI 1 1 STI 



Syntax 

Operation 

Operands 

Encoding 



31 



LDI src2, dstl 
|| STI src3,dst2 

src2 -> dst 1 
|| src3-^dst2 

src2 indirect (disp = 0, 1, IRO, IR1) 

dstl register (Rn1 , < n1 < 7) 

src3 register (Rn2, < n2 < 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



— i — 

1 1 



- 1 — i — i — r - 

110 1 



24 23 



t — r 

dstl 



1615 



I I 





T I 

src3 



T 1 1 1 — 

dst2 



t — i — r 



t — i — r — i — i — r— r 

src2 



Description 

Cycles 
Status Bits 



Mode Bit 



An integer load and an integer store are performed in parallel. If src2 and dst2 
point to the same location, src2 is read before the write to dst2. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 
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LDIIjSTI Parallel LDi [and STI 



Example LDI *-ARl(l),R2 

I | STI R7,*AR5++(IR0) 

Before Instruction: 



AR1 = 8098E7h 

R2 = 0h 

R7 = 35h = 53 

AR5 = 80982Ch 

IR0 = 8h 

Data at 8098E6h = ODCh = 220 

Data at 80982Ch = Oh 

LUFLV UFNZ V C = 0'0 

After Instruction: 

AR1 = 8098E7h 

R2 = ODCh = 220 

R7 = 35h = 53 

AR5 = 809834h 

IRO = 8h 

Data at 8098E6h = ODCh = 220 

Data at 80982Ch = 35h = 53 

LUFLV UF N Z V C = 00 



1 0-98 Assembly Language Instructions 



Load Floating-Point Mantissa j-.PM 



Syntax 

Operation 

Operands 



Encoding 



Description 



Cycles 
Status Bits 



31 



1 — r~ 




LDM src, dst 

src (man) -* dst (man) 

src general addressing modes (G): 

register (Rn, < n < 7) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, 0<n<7) 



24 23 

— i — i — i — i — r~ 

1 1 



t — i — i — r 

dst 



16 15 



87 



i — i — i — i — i — i — i — i — r 

src 



i — r 



i — i — r 



The mantissa field of the src operand is loaded into the mantissa field of the 
dst register. The dst exponent field is not modified. The src and dst operands 
are assumed to be floating-point numbers. If the src operand is from memory, 
the entire memory contents are loaded as the mantissa. If immediate address- 
ing mode is used, bits 1 5 —1 2 of the instruction word are forced to by the as- 
sembler. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



Mode Bit 
Example 



OVM Operation is not affected by OVM bit value. 

LDM 156.75,R2 (156.75 = O71CC0OO00h) 

Before Instruction: 

R2 = Oh 

LUFLV UF N Z V C = 00 

After Instruction: 

R2 = 001 CCOOOOOh = 1 .22460938e + 00 
LUFLV UF N Z V C = 
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LDP Load Data Page Pointer 



Syntax 

Operation 

Operands 

Encoding 



31 



— i — r— 





LDP src, DP 

src-> Data page pointer 

src is the 8 MSBs of the absolute 24-bit source address (src). 
The ",DP" in the operand is optional. 

dst register (Rn, < n < 7) 



24 23 



— t — i — i — i — r 

1 00 



T" 
1 1 



— i — i — i — r 

10 



1615 



t — i — i — i — i — i — r 





87 



t — i — i — r 

src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



This pseudo-op is an alternate form of the LDI instruction, except that LDP is 
always in the immediate addressing mode. The src operand field contains the 
eight MSBs of the absolute 24-bit srcaddress (essentially, only bits 23 — 16 of 
src are used). These eight bits are loaded into the eight LSBs of the data page 
pointer. 

The eight LSBs of the pointer are used in direct addressing as a pointer to the 
page of data being addressed. There is a total of 256 pages, each page 6.4K 
words long. Bits 31 — 8 of the pointer are reserved and should be kept to zero. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF Unaffected. 

N Unaffected. 

Z Unaffected. 

V Unaffected. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

LDP @809900h, DP 

or 

LDP @809900h 

Before Instruction: 

DP = 65h 

LUFLV UF N Z V C = 00 

After Instruction: 

DP = 80h 

LUFLV UF N Z V C = 00000 
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Assembly Language Instructions 



Logical Shift LSH 



Syntax 


LSH count, dst 


Operation 


If count >0: 




dst « count -» dst 




Else: 




dst» \count\ -> dst 


Operands 


count general addressing modes (G) 




register (Rn, < n < 27) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



t — r— 





dst register (Rn, < n < 27) 



24 23 

— 1 — I — I 1 T" 

1 1 1 



t — i — i — r 

dst 



16 15 87 

t — i — i — i — i — i — i — i — i — i — i — i — i — i — r 



count 



Description 



The seven least significant bits of the count operand are used to generate the 
twos-complement shift count. If the count operand is greater than zero, the dst 
operand is left-shifted by the value of the count operand. Low-order bits shifted 
in are zero-filled, and high-order bits are shifted out through the C (carry) bit. 

Logical left-shift: 
C <r- dst <- 

If the count operand is less than zero, the dst is right-shifted by the absolute 
value of the count operand. The high-order bits of the dst operand are zero- 
filled as shifted to the right. Low-order bits are shifted out through the C (carry) 
bit. 

Logical right-shift: 
0->dsf->C 

If the count operand is 0, no shift is performed and the C (carry) bit is set to 0. 
The count operand is assumed to be a signed integer and the dst operand is 
assumed to be an unsigned integer. 
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LSH Logical Shift 



Cycles 
Status Bits 



1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Set to the value of the last bit shifted out. for a shift count of 0. 



Mode Bit 
Example 1 



Example 2 



OVM Operation is not affected by OVM bit value. 

LSH R4,R7 

Before Instruction: 

R4 = 018h = 24 
R7 = 02ACh 

LUFLV UF N Z V C = 00 

After Instruction: 

R4 = 018h = 24 

R7 = OACOOOOOOh 

LUFLV UF N Z V C = 1 1 

LSH *-AR5 (IR1) ,R5 

Before Instruction: 

AR5 = 809908h 

IR0 = 4h 

R5 = 0012C00000h 

Data at 809904h = 0FFFFFFF4h = -12 

LUFLV UF N Z V C = 00 

After Instruction: 

AR5 = 809908h 

IR0 = 4h 

R5 = 000001 2C00h 

Data at 809904h = 0FFFFFFF4h = -12 

LUFLV UFNZ'V C = 
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Assembly Language Instructions 



Logical Shift, 3-Operand LSH3 



Syntax 
Operation 



Operands 



Encoding 



31 



"I — T" 

1 



Description 



LSH3 count, src, dst 

If count > 0: 
src « cot/rrt -> dsf 

Else: 
src » /count / -» c/sf 

src three-operand addressing modes (T): 
register (Rn1 , < n < 27) 

1 indirect (disp = 0, 1 , IRO, IR1) 

1 register (Rn1 , < n1 < 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

courrt three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 
10 indirect (disp = 0, 1 , IRO, IR1) 

1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

dst register (Rn, < n < 27) 



24 23 



~ i — i — i — i — r 

10 



16 15 



dst 



t — i — r 



t — i — i — i — i — r 

count 



The seven least significant bits of the count operand are used to generate the 
twos-complement shift count. 

If the count operand is greater than zero, the dst operand is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled, and 
high-order bits are shifted out through the C (carry) bit. 

Logical left-shift: 
C <r- src <- 

If the count operand is less than zero, the src operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand are 
zero-filled as shifted to the right. Low-order bits are shifted out through the C 
(carry) bit. 

Logical right-shift: 
-> src -> C 

If the count operand is 0, no shift is performed and the C (carry) bit is set to 0. 
The count operand is assumed to be a signed integer. The src and dst oper- 
ands are assumed to be unsigned integers. 
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LSH3 



Logical Shift, 3-Operand 



Cycles 
Status Bits 



Mode Bit 
Example 1 



Example 2 



1 

These condition flags are modified only if the destination register is R7 — RO. 
LUF Unaffected. 
LV Unaffected. 
UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Set to the value of the last bit shifted out. for a shift count of 0. 

Unaffected if dst is not R7 — RO. 

OVM Operation is not affected by OVM bit value. 

LSH3 R4,R7,R2 

Before Instruction: 

R4 = 018h = 24 

R7 = 02ACh 

R2 = 0h 

LUFLVUFNZVC = 00000 

After Instruction: 

R4 = 018h = 24 

R7 = 02ACh 

R2 = OACOOOOOOh 

LUFLV UF N Z V C = 1 10 

LSH3 *-AR4 (IR1) ,R5,R3 

Before Instruction: 

AR4 = 809908h 

IR1 =4h 

R5 = 012C00000h 

R3 = 0h 

Data at 809904h = 0FFFFFFF4h = -12 

LUFLV UF N Z V C = 00 

After Instruction: 

AR4 = 809908h 

IR1 =4h 

R5 = 012C00000h 

R3 = 000001 2C00h 

Data at 809904h = 0FFFFFFF4h = -12 

LUFLVUFNZVC = 000 00 
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Assembly Language Instructions 



PamllelLSH3andSTI v J^H3JJ?TI 



Syntax 


LSH3 count, src2, dstl 




|| STI src3, dst2 


Operation 


If count >0: 




src2« count-* dstl 




Else: 




src2» \count | -> dsf7 




|| src3^dst2 



Operands 



Encoding 



31 



coivnf register (Rn1 , < n1 < 7) 
srd indirect (disp = 0, 1, IRO, IR1) 
dst 1 register (Rn3, < n3 < 7) 
src2 register (Rn4, < n4 < 7) 
dst2 indirect (disp = 0, 1, IRO, IR1) 



T — 
1 1 



-i — i — i — r~ 

1110 



24 23 



i — r 
dstl 



i i 

count 



t — r 
src3 



16 15 



t — i — i — i — i — i — r 

dst2 



87 



i — i — i — i — i — i — r 

src2 



Description 



The seven least significant bits of the count operand are used to generate the 
twos-complement shift count. 

If the count operand is greater than zero, the dst operand is left-shifted by 
the value of the count operand. Low-order bits shifted in are zero-filled and 
high-order bits are shifted out through the C (carry) bit. 

Logical left-shift: 
C <- dst2 <- 

If the count operand is less than zero, the dst operand is right-shifted by the 
absolute value of the count operand. The high-order bits of the dst operand 
are zero-filled as shifted to the right. Low-order bits are shifted out through the 
C (carry bit). 

Logical right-shift: 

-» dst2 -» C 

If the count operand is 0, no shift is performed and the carry bit is set to 0. 

The count operand is assumed to be a 7-bit signed integer, and the src2 and 
dstl operands are assumed to be unsigned integers. All registers are read at 
the beginning and loaded at the end of the execute cycle. This means that 
if one of the parallel operations (STI) reads from a register and the operation 
being performed in parallel (LSH3) writes to the same register, then STI ac- 
cepts as input the contents of the register before it is modified by the LSH3. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 
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LSH3IISTI Parallel LSH3 and STI 



Cycles 
Status Bits 



Mode Bit 
Example 



1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Set to the value of the last bit shifted out. for a shift count of 0. 

OVM Operation is affected by OVM bit value. 

LSH3 R2,*++AR3 (1) , RO 
| | STI R4, *-AR5 

Before Instruction: 



R2 = 18h = 24 




AR3 = 8098C2h 




R0 = 0h 




R4 = ODCh = 220 




AR5 = 8098A3h 




Data at 8098C3h = OACh 




Data at 8098A2h = Oh 




LUF LV UF N Z V C = 





After Instruction: 





R2 = 18h = 24 

AR3 = 8098C3h 

R0 = OACOOOOOOh 

R4 = ODCh = 220 

AR5 = 8098A3h 

Data at 8098C3h = OACh 

Data at 8098A2h = ODCh = 220 

LUF LV UF N Z V C = 1 



1 
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Assembly Language Instructions 



Parallel LSH3 and STI LSH3IISTI 



Example LSH3 R7 , *AR2- - ( 1 ) , R2 

| | STI RO,*+ARO (1) 

Before Instruction: 



R7 = 


0FFFFFFF4h 





12 


AR2 


= 809863h 






R2 = 


Oh 






R0 = 


12Ch = 300 






ARO 


= 8098B7h 






Data at 809863h = 


2C000000h 


Data at 8098B8h = 


Oh 




LUF 


LV UF N Z 


V 


C = 


After Instruction: 











R7 = 0FFFFFFF4h = -12 
AR2 = 809862h 
R2 = 2C000h 
R0 = 12Ch = 300 
ARO = 8098B7h 
Data at 809863h = 2C000000h 
Data at 8098B8h = 12Ch = 300 
LUF LVUFNZVC = 0000 
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MPYF 



Multiply Floating-Point 



Syntax 


MPYF src, dst 


Operation 


dst x src -^ dst 


Operands 


src general addressing modes (G) 




register (Rn, < n < 7) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



1 — I - 





dst register (Rn, < n < 7) 



24 23 



"I — I 1 1 T 

1 10 



i — i — r 
dst 



16 15 



t — i — i — i — r 



87 

t — i — i — i — i — i — i — i — r 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The product of the dst and src operands is loaded into the dst register. The src 
operand is assumed to be a single-precision floating-point number, and the dst 
operand is an extended-precision floating-point number. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point is overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

MPYF R0,R2 

Before Instruction: 

R0 = 070C800000h = 1 .4050e + 02 
R2 = 034C200000h = 1 .275781 25e + 01 

LUFLV UF N Z V C = 00000 

After Instruction: 

R0 = 070C800000h = 1 .4050e + 02 

R2 = 0A600F2000h = 1 .79247266e + 03 

LUFLV UF N Z V C = 00 
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Assembly Language Instructions 



Multiply Floating-Point, 3-Operand MPYF3 



Syntax 

Operation 

Operands 



Encoding 



31 



— I — I - 

1 



MPYF3 src2, srd, dst 

srd x src2 -> dst 

srd three-operand addressing modes (T): 
register (Rn1, 0<n1 <7) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rn1, 0<n1 <7) 

11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 7) 

1 register (Rn2, < n2 < 7) 

10 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

dst register (Rn, < n < 7) 



24 23 



16 15 



87 



- 1 — i — i — i — r - 

10 1 



t — r 

dst 



t — i — i — i — r 

srd 



t — r 



i — i — i — i — r 

src2 



Description 

Cycles 
Status Bits 



Mode Bit 



The product of the dstl and src2 operands is loaded into the dst register. The 
srd and src2operands are assumed to be single-precision floating-point num- 
bers, and the dst operand is an extended-precision floating-point number. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point is overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 
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MPYF3 Multiply Floating-Point, 3-Operand 



Example 1 



MPYF3 R0,R7,R1 

Before Instruction: 

RO = 057B400000h = 6.281 250e + 01 

R7 = 0733C00000h = 1 .79750e + 02 

R1=0h 

LUFLV UF N Z V C = 00 



Example 2 



After Instruction: 

RO = 057B400000h = 6.281 250e + 01 

R7 = 0733C00000h = 1 .79750e + 02 

R1 = 0D306A3000h = 1 .1 2905469e + 04 

LUF LV UF N Z V C = 

MPYF3 *+AR2 (IR0) ,R7,R2 

or 

MPYF3 R7,*+AR2 (IRO) , R2 

Before Instruction: 

AR2 = 809800h 

IR0 = 12Ah 

R7 = 057B400000h = 6.281 250e + 01 

R2 = 0h 

Data at 80992Ah = 70C8000h = 1 .4050e + 02 

LUFLV UF N Z V C = 

After Instruction: 

AR2 = 809800h 

IR0 = 12Ah 

R7 = 057B400000h = 6.281 250e + 01 

R2 = 0D09E4A000h = 8.8251 5625e + 03 

Data at 80992Ah = 70C8000h = 1 .4050e + 02 

LUF LV UF N Z V C = 
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Assembly Language Instructions 



Parallel MPYF3 and ADDF3 MPYF3|J ADDF3 



Syntax 


MPYF3 srcA, srcB, dstl 
|| ADDF3 srcC, srcD, dst2 


Operation 


srcA x srcB -> dstl 
|| srcC+ srcD-> dst2 


Operands 








srcA 
srcB 
srcC 
srcD 


Any two indirect (disp = 0,1 JR0.IR1 ) 
Any two register (0 < Rn < 7) 




dstl 


register (d1): 

= R0 

1 =R1 




dst2 


register {d2): 

= R2 

1 =R3 




srd 
src2 
src3 
src4 


- register (Rn, < n < 7) 
register (Rn, < n < 7) 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0,1, IRO, IR1 ) 



P parallel addressing modes (0 < P < 3) 

Operation (P Field) 



Encoding 



31 



00 
01 
10 

11 



24 23 



src3 x src4, srd + src2 
src3 xsrd, src4 + src2 
srd x src2, src3 + src4 
src3 x srd, src2 + src4 



16 15 



87 



I " 

1 


1 1 1 




1 

p 


d1 


d2 


I 1 

srd 


l l 

src2 


1 1 1 1 1 1 1 

src3 


1 111 111 

src4 



Description 



A floating-point multiplication and a floating-point addition are performed in 
parallel. All registers are read at the beginning and loaded at the end of the ex- 
ecute cycle. This means that if one of the parallel operations (MPYF3) reads 
from a register and the operation being performed in parallel (ADDF3) writes 
to the same register, then MPYF3 accepts as input the contents of the register 
before it is modified by the ADDF3. 
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MPYF3JJ ADDF3 Parallel MPYF3 andADDF3 



Cycles 
Status Bits 



Mode Bit 
Example 



Any combination of addressing modes may be coded for the four possible 
source operands as long as two are coded as indirect and two are register. The 
assignment of the source operands srcA - srcDto the srd - src4\ ields varies, 
depending on the combination of addressing modes used, and the P field is 
encoded accordingly. The assembler may, when not significant, change the or- 
der of operands in commutative operations in order to simplify processing. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 

1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 0. 

Z 0. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

MPYF3 *AR5++(1) ,*--ARl (IRO) , RO 
I | ADDF3 R5,R7,R3 



Before Instruction: 

AR5 = 8098C5h 

AR1 = 8098A8h 

IRO = 4h 

R0 = 0h 

R5 = 0733C00000h = 1 .79750e + 02 

R7 = 070C800000h = 1 .4050e + 02 

R3 = Oh 

Data at 8098C5h = 34C0000h = 1 .2750e + 01 

Data at 8098A4h = 111 OOOOh = 2.2500e + 00 

LUF LV UF N Z V C = 
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Assembly Language Instructions 



Parallel MPYF3 andADDF3 MPYF3 j J ADDF3 



After Instruction: 

AR5 = 8098C6h 

AR1 = 8098A4h 

IR0 = 4h 

RO = 04671 80000h = 2.888671 88e + 01 

R5 = 0733C00000h = 1 .79750e + 02 

R7 = 070C800000h = 1 .4050e + 02 

R3 = 0820200000h = 3.20250e + 02 

Data at 8098C5h = 34C0000h = 1 .2750e + 01 

Data at 8098A4h = 1110000h = 2.2500e + 00 

LUF LV UF N Z V C = 
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M P YF3 1 1 STF Parallel MPYF3 and STF 



Syntax 

Operation 

Operands 



MPYF3 
STF 



src2, srd, dst 
src3, dst2 



srd x src2-^ dstl 
|| src3-^dst2 

srd register (Rn1 , < n1 < 7) 

src2 indirect (disp = 0, 1, IRO, IR1) 

dst 1 register (Rn3, < n3 < 7) 

src3 register (Rn4, < n4 < 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



Encoding 



31 



-T— 

1 1 



Description 



Cycles 
Status Bits 



Mode Bit 



—i — i — i — I - 

1111 



24 23 



16 15 



87 



t — r 
dstl 



i — r 
srd 



t — r 
src3 



t — i — i — i — i — i — r 

dst2 



t — i — i — i — i — r 

src2 



A floating-point multiplication and a floating-point store are performed in paral- 
lel. All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (MPYF3) writes to a reg- 
ister and the operation being performed in parallel (STF) reads from the same 
register, then the STF accepts as input the contents of the register before it is 
modified by the MPYF3. 

If src2 and dst2 point to the same location, then src2 is read before the write 
to dst2. 



1 

These condition flags are modified only if the destination register is R7 ■ 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 



R0. 
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Assembly Language Instructions 



Parallel MPYF3 and STF MPYF3I ISTF 



Example MPYF3 *-AR2 (1) ,R7, RO 

I | STF R3, *ARO (IRO) 

Before Instruction: 

AR2 = 80982Bh 

R7 = 057B400000h = 6.281 250e + 01 

R0 = 0h 

R3 = 086B280000h = 4.7031 250e + 02 

ARO = 809860h 

IRO = 8h 

Data at 80982Ah = 70C8000h = 1 .4050e + 02 

Data at 809860h = Oh 

LUFLV UF N Z V C = 00 

After Instruction: 

AR2 = 80982Bh 

R7 = 057B400000h = 6.281 250e + 01 

RO = 0D09E4A000h = 8.8251 5625e + 03 

R3 = 086B280000h = 4.7031250e + 02 . 

ARO = 809858h 

IR0 = 8h 

Data at 80982Ah = 70C8000h = 1 .4050e + 02 

Data at 809860h = 86B280000h = 4.7031 250e + 02 

LUF LV UF N Z V C = 
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MPYF3IISUBF3 



Parallel MPYF3 and SUBF3 



Syntax 

Operation 

Operands 



MPYF3 srcA, srcB, dstl 
SUBF3 srcC, srcD, dst2 

srcA x srcB -> dst 1 
srcD - srcC -> dst2 



srcA 
srcB 
srcC 
srcD 

dstl 



dst2 



srd 
src2 
src3 
src4 



Any two indirect (disp = 0,1, IR0JR1) 
Any two register (0 < Rn < 7) 



register (d1): 

= R0 

1 =R1 

register (d2): 

= R2 

1 =R3 

register (Rn, < ri < 7) 

register (Rn, < n < 7) 

indirect (disp = 0, 1, IRO, IR1) 

indirect (disp = 0, 1, IRO, IR1) 



P parallel addressing modes (0 < P < 3) 

Operation (P Field) 



Encoding 



31 



1 



— I 1 T - 

1 



00 
01 
10 
11 



24 23 



src3 x src4, srd - src2 
src3 x srd, src4 - src2 
srd x src2, src3 - src4 
src3 xsrd, src2 - src4 



16 15 



d1 



d2 



srd 



t — r 

src2 



t — r 



t — i — r 



87 



src3 



t — i — i — i — r 

src4 



Description 



A floating-point multiplication and a floating-point subtraction are performed in 
parallel. All registers are read at the beginning and loaded at the end of the ex- 
ecute cycle. This means that if one of the parallel operations (MPYF3) reads 
from a register, and the operation being performed in parallel (SUBF3) writes 
to the same register, then MPYF3 accepts as input the contents of the register 
before it is modified by the SUBF3. 
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Assembly Language Instructions 



Parallel MPYF3 and SUBF3 MPYF3||SUBF3 



Cycles 
Status Bits 



Mode Bit 
Example 



Any combination of addressing modes may be coded for the four possible 
source operands as long as two are coded as indirect and two are register. The 
assignment of the source operands srcA-srcDto the src) -src4fields varies, 
depending on the combination of addressing modes used, and the P field is 
encoded accordingly. The assembler may, when not significant, change the 
order of operands in commutative operations in order to simplify processing. 



1 

These condition flags are modified only if the destination register is R7 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 0. 

Z 0. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

MPYF3 R5,*++AR7 (IR1) , RO 
|| SUBF3 R7,*AR3--(1) ,R2 

or 

MPYF3 * + +AR7(IRl), R5 , RO 
|| SUBF3 R7,*AR3 (1),R2 

Before Instruction: 

R5 = 034C000000h = 1 .2750e + 01 

AR7 = 809904h 

IR1 =8h 

R0 = 0h 

R7 = 0733C00000h = 1 ,79750e + 02 

AR3 = 8098B2h 

R2 = 0h 

Data at 80990Ch = 1110000h = 2.250e + 00 

Data at 8098B2h = 70C8000h = 1 .4050e + 02 

LUFLV UF N Z V C = 00 



R0. 
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MPYF3|]SUBF3 Parallel MPYF3 and SUBF3 



After Instruction: 



R5 = 034C000000h = 1.2750e + 01 

AR7 = 80990Ch . 

IR1 =8h 

R0 = 04671 80000h = 2.888671 88e + 01 

R7 = 0733C00000h = 1 .79750e + 02 

AR3 = 8098B1h 

R2 = 05E3000000h = - 3.9250e + 01 

Data at 80990Ch = 1110000h = 2.250e + 00 

Data at 8098B2h = 70C8000h = 1 .4050e + 02 

LUFLV UF N Z V C = 
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Multiply Integer MPYI 



Syntax 

Operation 

Operands 



Encoding 



31 



T — |— 




MPYI src, dst 
dst x src -> dst 

src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 27) 



24 23 



16 15 



I I I I I 

1 10 1 



t I i r 

dst 



87 

t — i — i — i — i — i — i — i — i — i — i — i — r 



src 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The product of the dst and src operands is loaded into the dst register. The src 
and dst operands, when read, are assumed to be 24-bit signed integers. The 
result is assumed to be a 48-bit signed integer. The output to the dst register 
is the 32 least significant bits of the result. 

Integer overflow occurs when any of the most significant 16 bits of the 48-bit 
result differs from the most significant bit of the 32-bit output value. 



1 

These condition flags are modified only if the destination register is R7 

LUF Unchanged. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is affected by OVM bit value. 

MPYI R1,R5 

Before Instruction: 



R0. 



R1 = 000033C251h = 3,392,081 
R5 = 000078B600h = 7,91 0,912 
LUF LV UF N Z V C = 

After Instruction: 







R1 = 000033C251 h = 3,392,081 
R5 = 00E21 D9600h = - 501 ,377,536 
LUF LV UF N Z V C = 1 1 



1 
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MPYI3 Multiply Integer, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



31 



t — r— 

1 



MPYI3 src2, srd, dst 

srd x src2 -> dst 

srd three-operand addressing modes (T): 
register (Rn1, n1 <27) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rn1,<n1 <27) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 

1 indirect (disp = 0, 1 , IRO, IR1 ) 
1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

dst register (Rn, < n < 27) 



24 23 



- 1 — i — i — i — r 

10 1 



t — i — r 
dst 



16 15 



t — r 



i i r 

srd 



i — r 



87 



t — i — i — i — i — r 

src2 



Description 



Cycles 
Status Bits 



The product of the srd and src2 operands is loaded into the dst register. The 
srd and src2 operands are assumed to be 24-bit signed integers. The result 
is assumed to be a signed 48-bit integer. The output to the dst register is the 
32 least significant bits of the result. 

Integer overflow occurs when any of the most significant 1 6 bits of the 48-bit 
result differs from the most significant bit of the 32-bit output value. 



Mode Bit 



1 

These condition flags are modified only if the destination register is R7 

LUF Unchanged. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is affected by OVM bit value. 



R0. 
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Assembly Language Instructions 



Multiply Integer, 3-Operand MPYI3 



Example 1 



Example 2 



MPYI3 *AR4, *-ARl (1) , R2 

Before Instruction: 

AR4 = 809850h 

AR1 = 8098F3h 

R2 = 0h 

Data at 809850h = OADh = 173 

Data at 8098F2h = ODCh = 220 

LUFLV UF N Z V C = 00 

After Instruction: 

AR4 = 809850h 

AR1 = 8098F3h 

R2 = 094ACh = 38,060 

Data at 809850h = OADh = 173 

Data at 8098F2h = ODCh = 220 

LUFLV UF N Z V C = 00 

MPYI3 *— AR4 (IRO) , R2 , R7 

Before Instruction: 

AR4 = 8099F8h 

IR0 = 8h 

R2 = 0C8h = 200 

R7 = Oh 

Data at 8099F0h = 32h = 50 

LUFLV UF N Z V C = 00 

After Instruction: 

AR4 = 8099F0h 

IR0 = 8h 

R2 = 0C8h = 200 

R7 = 02710h = 10,000 

Data at 8099F0h = 32h = 50 

LUFLV UF N Z V C = 00 
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MPYi3||ADDI3 


Parallel MPY13 andADDI3 


Syntax 


MPYI3 srcA, srcB, dstl 
|| ADDI3 srcC, srcD, dst2 


Operation 


srcAxsrcB-* dstl 




|| srcD + srcC -> dst2 


Operands 








srcA 
srcB 
srcC 
srcD 


Any two indirect (disp = 0,1, IR0,IR1) 
Any two register (0 < Rn < 7) 




dstl 


register (d1): 

= R0 

1 =R1 




dst2 


register (d2): 

= R2 

1 =R3 




srd 


register (Rn, < n < 7) 




src2 
src3 
src4 


register (Rn, < n < 7) 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, IRO, IR1) 



P parallel addressing modes (0 < P < 3). 

Operation (P Field) 



Encoding 



31 



1 



_, , — ! — 

10 



00 
01 
10 
11 



24 23 



d1 



d2 



t — r 

srd 



src3 x src4, srd + src2 
src3 x srd, src4 + src2 
srd x src2, src3 + src4 
src3 x srd, src2 + src4 



1615 



i — r 
src2 



t — i — i — i — i — r 

src3 



i — i — i — i — i — i — r 

src4 



Description 



An integer multiplication and an integer addition are performed in parallel. All 
registers are read at the beginning and loaded at the end of the execute cycle. 
This means that if one of the parallel operations (MPYI3) reads from a register 
and the operation being performed in parallel (ADDI3) writes to the same regis- 
ter, then MPYI3 accepts as input the contents of the register before it is modi- 
fied by the ADDI3. 
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Assembly Language Instructions 



Parallel MPYI3 andADDI3 MPYI3| | ADD13 



Cycles 
Status Bits 



Mode Bit 
Example 



Any combination of addressing modes may be coded for the four possible 
source operands as long as two are coded as indirect and two are register. The 
assignment of the source operands srcA - srcDtothe srd - src4i ields varies, 
depending on the combination of addressing modes used, and the P field is 
encoded accordingly. The assembler may, when not significant, change the or- 
der of operands in commutative operations in order to simplify processing. 

1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF Unchanged. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 0. 

Z 0. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is affected by OVM bit value. 



MPYI3 
ADD 1 3 



R7,R4,R0 
*-AR3, *AR5- 



-(D,R3 



Before Instruction: 



R7 = 14h = 20 












R4 = 64h = 100 












RO = Oh 












AR3 = 80981 Fh 












AR5 = 80996Eh 












R3 = 0h 












Data at 80981 Eh = 


OFFFFFFCBh : 


= — 


53 




Data at 80996Eh = 


35h = 53 










LUF LV UF N Z 


V C = 














After Instruction: 













R7 = 14h = 20 

R4 = 64h = 100 

R0 = 07D0h = 2000 

AR3 = 80981 Fh 

AR5 = 80996Dh 

R3 = Oh 

Data at 80981 Eh = OFFFFFFCBh = - 53 

Data at ,80996Eh = 35h = 53 

LUFLV UF N Z V C = 00 
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MPYI3HSTI 

iissatsssamtsasosssSi&sasmssm' 


^^JS^S^M^^^J! 3 


^^^^ 


__» 


BtammmmmmiHMmBBmmimmm, 


««_ 


___ 


_ 


Syntax 


MPYI3 src2, srd, dstl 
II STI src3, dst2 








Operation 


srd x src2 -> dstl 
|| src3->dst2 








Operands 


srd register (Rn1 , < n1 < 7) 
src2 indirect (disp = 0, 1, IRO, IR1) 
dstl register (Rn3, < n3 < 7) 
src3 register (Rn4, < n4 < 7) 
dst2 indirect (disp = 0,1, IRO, IR1 ) 








Encoding 












31 


24 23 16 15 8 7 









1 1 


1 


i i 1 




i i 

dstl 


i i 

srd 


i i 

src3 


1 1 1 1 1 1 I™ 

dst2 


l l l l — I — I — r— 

src2 


Description 






An intec 


ier multip 


Dlication 


and an i 


iteaer store are oerfor 


med ir 


i narallel. / 


til ran 



Cycles 
Status Bits 



isters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (MPYI3) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by the 
MPYI3. 

If src2 and dst2 point to the same location, src2\s read before the write to dst2. 

Integer overflow occurs when any of the most significant 1 6 bits of the 48-bit 
result differs from the most significant bit of the 32-bit output value. 



1 



These condition flags are modified only if the destination register is R7 
LUF Unchanged. 

1 if an integer overflow occurs, unchanged otherwise. 

0. 

1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an integer overflow occurs, otherwise. 

Unaffected. 



R0. 



LV 
UF 
N 

Z 
V 

c 



Mode Bit 



OVM Operation is affected by OVM bit value. 
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Assembly Language Instructions 



Parallel MPYI3 and STI3 MPYI3] JSTI 



Example MPYI3 *++ARO (1) ,R5,R7 

I | STI R2, *-AR3 (1) 

Before Instruction: 



ARO = 80995Ah 

R5 = 32h = 50 

R7 = 0h 

R2 = ODCh = 220 

AR3 = 80982Fh 

Data at 80995Bh = 0C8h = 200 

Data at 80982Eh = Oh 

LUFLV UF N Z V C = 00 

After Instruction: 

ARO = 80995Bh 

R5 = 32h = 50 

R7 = 271 Oh = 10000 

R2 = ODCh = 220 

AR3 = 80982Fh 

Data at 80995Bh = 0C8h = 200 

Data at 80982Eh = ODCh = 220 

LUFLV UF N Z V C = 00 
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MPYI3[|SUBI3 


Parallel MPYI3 and SUBI3 


Syntax 


MPYI3 srcA, srcB, dstl 
|| SUBI3 srcC, srcD, dst2 


Operation 


srcA x srcB -> dstl 
|| srcD - srcC -> dst2 


Operands 








srcA 
srcB 
srcC 
srcD 


Any two indirect (disp = 0,1, IR0.IR1) 
Any two register (0 < Rn < 7) 




dstl 


register (d1): 

= R0 

1 =R1 




dst2 


register (d2)\ 

= R2 

1 =R3 




srd 
src2 
src3 
src4 


register (Rn, < n < 7) 
register (Rn, < n < 7) 
indirect (disp = 0, 1, IRO, IR1) 
indirect (disp = 0, 1, IRO, IR1) 



P parallel addressing modes (0 < P < 3) 

Operation (P Field) 



Encoding 









00 
01 
10 

11 




src3 x src4, srd - src2 
src3 x srd, src4 - src2 
srd x src2, src3 - src4 
src3 xsrd, src2 - src4 








31 


24 23 




1615 87 







1 

1 


l l l 

11 


i 
P 


d1 


d2 


I i 

srd 


I i 

src2 


I I I 1 I I 

src3 


i i i i i i i 

src4 



Description 



An integer multiplication and an integer subtraction are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (MPYI3) reads from a 
register and the operation being performed in parallel (SUBI3) writes to the 
same register, then MPYI3 accepts as input the contents of the register before 
it is modified by the SUBI3. 
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Assembly Language Instructions 



Parallel MPYI3 and SUBI3 MPYI3||SUB13 



Cycles 
Status Bits 



Mode Bit 
Example 



Any combination of addressing modes may be coded for the four possible 
source operands as long as two are coded as indirect and two are register. The 
assignment of the source operands srcA - srcDtothe srd - src4\ ields varies, 
depending on the combination of addressing modes used, and the P field is 
encoded accordingly. The assembler may, when not significant, change the or- 
der of operands in commutative operations in order to simplify processing. 

Integer overflow occurs when any of the most significant 1 6 bits of the 48-bit 
result differs from the most significant bit of the 32-bit output value. 

1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF Unchanged. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 1 if an integer underflow occurs, otherwise. 

N 0. 

Z 0. 

V 1 if an integer overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is affected by OVM bit value. 

MPYI3 R2,*++AR0 (1) , RO 
| | SUBI3 *AR5--(IR1) , R4 , R2 

or 

MPYI3 *++AR0 (1) ,R2,R0 
| | SUBI3 *AR5 (IR1),R4,R2 

Before Instruction: 

R2 = 32h = 50 

ARO = 8098E3h 

RO = Oh 

AR5 = 8099FCh 

IR1=0Ch 

R4 = 07D0h = 2000 

Data at 8098E4h = 62h = 98 

Data at 8099FCh = 4B0h = 1200 

LUFLVUFNZVC = 0000 
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MPYI3||SUBI3 Parallel MPYI3 and SUBI3 



After Instruction: 



R2 = 320h = 800 

ARO = 8098E4h 

R0 = 01324h = 4900 

AR5 = 8099F0h 

IR1 =0Ch 

R4 = 07D0h = 2000 

Data at 8098E4h = 62h = 98 

Data at 8099FCh = 4B0h = 1 200 

LUFLV UF N Z V C = 00 
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Nega tive In teg'er With Borrow N E G B 



Syntax 


NEGB src, dst 


Operation 


- src - C -> dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 27) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



—i — r~ 





dst register (Rn, 0<n<27) 



24 23 



16 15 



I I I I l 

1 110 



i — i — r 

dst 



i — i — i — i — i — i — r 



-| — i — i — i — r 

src 



i — i — r 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the 0, src, and C operands is loaded into the dst register. The 
dst and src are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

OVM Operation is affected by OVM bit value. 

NEGB R5,R7 

Before Instruction: 



1 



R5 = 0FFFFFFCBh = -53 

R7 = 34h = 52 

LUF LV UF N Z V C = 1 



R5 = 


OFFFFFFCBh 





-53 




R7 = 


Oh 








LUF 


LV UF N Z 


V 


C = 





After Instruction: 
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NEGF Negate Floating-Point 



Syntax 


NEGF src, dst 


Operation 


- src -» dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 7) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



Description 

Cycles 
Status Bits 



31 



t — r - 





dst register (Rn, < n < 7) 



24 23 



— i — i — i — i — r 

1 11 1 



i — i — r 

dst 



16 15 



t — i — r 



87 

t — i — i — i — i — i — i — i — i — i — r 



src 



The difference of the and src operands is loaded into the dst register. The 
dst and src operands are assumed to be floating-point numbers. 



1 

These condition flags are modified only if the destination register is R7 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 



R0. 



Mode Bit 
Example 



OVM Operation is not affected by OVM bit value. 

NEGF *++AR3 (2) ,R1 

Before Instruction: 

AR3 = 809800h 

R1 = 057B400025h = 6.281 25006e + 01 

Data at 809802h = 70C8000h = 1 .4050e + 02 

LUFLV UF N Z V C = 00 

After Instruction: 

AR3 = 809802h 

R1 = 07F3800000h = -1 .4050e + 02 

Data at 809802h = 70C8000h = 1 .4050e + 02 

LUFLV UFNZ V C = 
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Assembly Language Instructions 



Parallel NEGF and STF N EG F 1 1 STF 



Syntax 

Operation 

Operands 

Encoding 



31 



NEGF src2,dst1 
|| STF src3, dst2 

- src2 -» dst 1 
|| src3->dst2 

src2 indirect (disp = 0, 1 , IRO, IR1 ) 

dstl register (Rn1 , < n1 < 7) 

src3 register (Rn2, < n2 < 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 



24 23 



16 15 



1 1 



- 1 — i — i — r - 

10 1 



i — r 
dstl 



— i — i — 





i — r 

src3 



t — i — i — i — i — r 

dst2 



i — i — i — i — i — i — r 

src2 



Description 



Cycles 
Status Bits 



A floating-point negation and a floating-point store are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STF) reads from a reg- 
ister and the operation being performed in parallel (NEGF) writes to the same 
register, then STF accepts as input the contents of the register before it is modi- 
fied by the NEGF. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF 

LV 

UF 

N 

Z 

V 

c 



1 if a floating-point underflow occurs, unchanged otherwise. 

1 if a floating-point overflow occurs, unchanged otherwise. 

1 if a floating-point underflow occurs, otherwise. 

1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if a floating-point overflow occurs, otherwise. 

Unaffected. 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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NEG Fl ISTF Parallel NEFG and STF 



Example NEGF *AR4--(1),R7 

| | STF R2, * + +AR5 (1) 

Before Instruction: 



AR4 = 8098E1h 

R7 = 0h 

R2 = 0733C00000h = 1 .79750e + 02 

AR5 = 809803h 

Data at 8098E1 h = 57B400000h = 6.281 250e + 01 

Data at 809804h = Oh 

LUFLV UF N Z V C = 

A fter Instruction: 

AR4 = 8098E0h 

R7 = 0584C00000h = - 6.281 250e + 01 

R2 = 0733C00000h = 1 .79750e + 02 

AR5 = 809804h 

Data at 8098E1 h = 57B4000h = 6.281 250e + 01 

Data at 809804h = 733C000h = 1 .79750e + 02 

LUFLV UF N Z V C = 00 
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Negate Integer NEGI 



Syntax 


NEGI src, dst 


Operation 


- src — » dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 27) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



I I 





dst register (Rn, < n < 27) 



24 23 



I I I I I 

1 10 



t — r 



16 15 



87 



dst 



i — i — i — i — i — i — i — i — i — i — i — r 

src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the and srcoperands is loaded into the dst register. The dst 
and srcoperands are assumed to be signed integers. 



1 

These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

OVM Operation is affected by OVM bit value. 

NEGI 174, R5 (174 = OAEh) 

Before Instruction: 

R5 = ODCh = 220 

LUFLV UF N Z V C = 00 

After Instruction: 

R5 = 0FFFFFF52 = -174 

LUF LV UF N Z V C = 1 1 



R0. 
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NEG I ] )STI Parallel NEGI and STI 



Syntax 

Operation 

Operands 

Encoding 



NEGI 
STI 



src2, dstl 
src3, dst2 



- src2 -> dstl 
|| src3^>dst2 

src2 indirect (disp = 0, 1, IRO, IR1) 

dstl register (Rn1 , < n1 < 7) 

src3 register (Rn2, < n2 < 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



31 



24 23 



- 1 — 
1 1 



Description 



Cycles 
Status Bits 



Mode Bit 



— i — i — i — i— 

10 10 



i — r 
dstl 



— I — I — 




16 15 



src3 



t i i i i i r 

dst2 



87 



t — i — i — r 

src2 



An integer negation and an integer store are performed in parallel. All registers 
are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (NEGI) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by the 
NEGI. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

OVM Operation is affected by OVM bit value. 
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Assembly Language Instructions 



Parallel NEGI and STI N EG 1 1 1 STI 



Example NEGI *-AR3,R2 • 

| | STI R2, *AR1++ 

Before Instruction: 



AR3 = 80982Fh 

R2 = 19h = 25 

AR1 = 8098A5h 

Data at 80982Eh = ODCh = 220 

Data at 8098A5h = Oh 

LUF LVUFNZVC = 00 

After Instruction: 

AR3 = 80982Fh 

R2 = 0FFFFFF24h = -220 

AR1 = 8098A6h 

Data at 80982Eh = ODCh = 220 

Data at 8098A5h = 1 9h = 25 

LUF LV UF N Z V C = 1 1 
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NOP No Operation 

Syntax 
Operation 

Operands 
Encoding 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 1 



Example 2 



NOP src 

No ALU or multiplier operations. 

ARn is modified if src is specified in indirect mode. 

src general addressing modes (G): 

register (no operation) 

1 indirect (modify ARn, < n < 7) 



31 




24 23 






16 15 


87 





I I 




I 1 

1 


i i i 

10 1 


1 

G 


1 




I I 




I i 1 I I I I I I 1 I 1 I I I 

src 



If the src operand is specified in the indirect mode, the specified addressing 
operation is performed and a dummy memory read occurs. If the src operand 
is omitted, no operation is performed. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

NOP 

Before Instruction: 

PC = 3Ah 

After Instruction: 

PC = 3Bh 

NOP *AR3--(1) 

Before Instruction: 

PC = 5h 

AR3 = 809900h 

After Instruction: 

PC = 6h 

AR3 = 8098FFh 
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Assembly Language Instructions 



Normalize NORM 



Syntax 


NORM src, dst 


Operation 


norm (src) -> dst 


Operands 


src general addressing modes (G) 




register (Rn, < n < 7) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



t — r— 





—i — m — i— 

110 1 



24 23 



i — i — r 
dst 



16 15 



t — i — r 



87 



i — i — i — i — i — r 
src 



Description 



Cycles 
Status Bits 



The srcoperand is assumed to be an unnormalized floating-point number; i.e., 
the implied bit is set equal to the sign bit. The dst is set equal to the normalized 
srcoperand with the implied bit removed. The dst operand exponent is set to 
the srcoperand exponent minus the size of the left-shift necessary to normal- 
ize the src. The dst operand is assumed to be a normalized floating-point num- 
ber. 

If src (exp) = -1 28 and src ('man) = 0, then dst= 0, Z = 1 , and UF = 0. If src (exp) 
= -128 and src (man) * 0, then dst= 0, Z = 0, and UF = 1. For all other cases 
of the src, if a floating-point underflow occurs, then dst ('man) is forced to and 
dst (exp) = -1 28. If src (man) = 0, then dst (man) = and dst (exp) = -1 28. Re- 
fer to Section 4.6. 



Mode Bit 



1 

These condition flags are modified only if the destination register is R7 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV Unaffected. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 



R0. 



10-137 



NORM Normalize 



Example 



NORM R1,R2 

Before Instruction: 

R1 = 0400003AF5h 
R2 = 070C800000h 
LUFLV UF N Z V C = 00 

After Instruction: 

R1 = 0400003AF5h 

R2 = F26BD40000h = 1.1 2451 61 3e - 04 

LUFLV UF N Z V C = 00 
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Assembly Language Instructions 



Syntax 

Operation 

Operands 



Encoding 



31 



t — r - 





Description 

Cycles 
Siatus Bits 



Mode Bit 
Example 



Bitwise Logical-Complement NOT 



NOT src, dst 

~src -» dst 

src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 27) 



24 23 

~~i — i — i — i — r - 

1 10 1 1 



t — i — r 

dst 



16 15 



87 



t — i — i — i — i — i — i — i — i — i — i — i — i — r 

src 



The bitwise logical-complement of the src operand is loaded into the dst regis- 
ter. The complement is formed by a logical-NOT of each bit of the src operand. 
The dst and src operands are assumed to be unsigned integers. 



1 

These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is affected by OVM bit value. 

NOT @982Ch,R4 

Before Instruction: 



R0. 



DP = 80h 




R4 = 0h 




Data at 80982Ch = 5E2Fh 




LUF LV UF N Z V C = 





After Instruction: 





DP = 80h 

R4 = 0FFFFA1D0h 

Data at 80982Ch = 5E2Fh 

LUF LV UF N Z V C = 1 
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NOT||STI Parallel NOT and STI 



Syntax 

Operation 

Operands 

Encoding 



31 



1 1 



Description 



Cycles 
Status Bits 



Mode Bit 



NOT src2, dstl 
|| STI src3, dst2 

~src2 -> dstl 
|| src3->dst2 

src2 indirect (disp = 0, 1, IRO, IR1) 

dstl register (Rn1 , < n1 < 7) 

src3 register (Rn2, < n2 < 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



- i — i — i — r- 

10 11 



24 23 



16 15 



87 



dstl 



I l 





src3 



t i i i r 

dst2 



i i i r 

src2 



A bitwise logical-NOT and an integer store are performed in parallel. All regis- 
ters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (NOT) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by the 
NOT. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 



10-140 



Assembly Language Instructions 



Parallel NOT and STI NOT| |STI 



Example NOT *+AR2,R3 

| | STI R7,*-- AR4 (IR1) 

Before Instruction: 



AR2 


= 8099CBh 






R3 = 


Oh 






R7 = 


ODCh = 220 






AR4 


= 809850h 






IR1 = 


= 10h 






Data at 8099CCh = 


: 0C2Fh 


Data at 809840h = 


Oh 




LUF 


LV UF N Z 


V 


C = 


After Instruction: 











AR2 = 8099CBh 

R3 = 0FFFFF3D0h 

R7 = ODCh = 220 

AR4 = 809840h 

IR1 =10h 

Data at 8099CCh = 0C2Fh 

Data at 809840h = ODCh = 220 

LUF LV UF N Z V C = 1 
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OR Bitwise Logical-OR 



Syntax 


OR src, dst 


Operation 


dst OR src -> dst 


Operands 


src general addressing modes (G): 




register (Rn, 0, < n < 27) 




1 direct 




1 indirect 




11 immediate (not sign-extended) 



Encoding 



Description 

Cycles 
Status Bits 



31 



—\ — r - 




Mode Bit 
Example 



dst register (Rn, < n < 27) 



24 23 



- 1 — i — i — i — r~ 

10 



t — i — r 

dst 



16 15 



87 



i — i — i — i — i — r 



t — i — r 

src 



t — i — i — i — r 



The bitwise logical OR between the src and dst operands is loaded into the dst 
register. The dst and src operands are assumed to be unsigned integers. 



1 

These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

OR *++ARl (IR1) ,R2 

Before Instruction: 



R0. 



AR1 


= 809800h 








IR1 = 


= 4h 








R2 = 


012560000h 








Data at 809804h = 


2BCDh 




LUF 


LV UF N Z 


V 


C = 





After Instruction: 













AR1 = 809804h 

IR1 =4h 

R2 = 012562BCDh 

Data at 809804h = 2BCDh 

LUF LV UF N Z V C = 
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Assembly Language Instructions 



Bitwise Logical-OR, 3-Operand 0R3 



Syntax 

Operation 

Operands 



Encoding 



31 



"n — r- 

1 



0R3 src2, srd, dst 

srd OR src2 -> cfe? 

src7 three-operand addressing modes (T): 
register (Rn1, n1 <27) 

1 indirect (disp = 0, 1, IRO, IR1) 
10 register (Rn1, < n1 < 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1 ) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 
10 indirect (disp = 0,1, IRO, 1R1 ) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, < n < 27) 



24 23 



16 15 



- 1 — i — i — i — r— 

10 1 1 



t — i — r 

dst 



T T 



t i r 

srd 



1 — T 



T 1 1 T 

src2 



t — r 



Description 

Cycles 
Status Bits 



Mode Bit 



The bitwise logical-OR between the srd and src2 operands is loaded into the 
dst register. The srd, src2, and dst operands are assumed to be unsigned inte- 
gers. 

1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 
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0R3 Bitwise Logical-OR, 3-Operand 

Example 0R3 *++ARl ( IR1 ) , R2 , R7 

Before Instruction: 



AR1 =809800h 

IR1 =4h 

R2 = 012560000h 

R7 = Oh 

Data at 809804h = 2BCDh 

LUFLV UF N Z V C = 00 

After Instruction: 

AR1 = 809804h 

IR1 =4h 

R2 = 012560000h 

R7 = 012562BCDh 

Data at 809804h = 2BCDh 

LUFLV UF N Z V C = 00 
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Parallel 0R3 and STI 0R3I ISTI 



Syntax 



Operation 



Operands 



Encoding 



31 



0R3 src2, srd, dstl 
|| STI src3,dst2 

srd OR src2^ dstl 
| src3 -> dst2 

srd register (Rn1 , < n1 < 7) 
src2 indirect (disp = 0, 1 , IRO, IR1 ; 
dstl register (Rn2, < n2 < 7) 
src3 register (Rn3, < n3 < 7) 
dst2 indirect (disp = 0, 1, IRO, IRt 



i — 
1 1 



I i i i 

10 10 



24 23 

— i — r 

dstl 



16 15 



-\ — r 
srd 



t — r 
src3 



t — i — i — i — i — i — r 

dst2 



i — i — i — i — r 
src2 



t — r 



Cycles 
Status Bits 



Mode Bit 



A bitwise logical-OR and an integer store are performed in parallel. All registers 
are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (OR3) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by the 
OR3. 

If src2 and dst2 point to the same location, src2\s read before the write to dst2. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 
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OR3I ISTI Parallel OR3 and STI 



Example 0R3 *++AR2 , R5 , R2 

| | STI R6, *AR1-- 

Before Instruction: 



AR2 = 809830h 

R5 = 800000h 

R2 = 0h 

R6 = ODCh = 220 

AR1 = 809883h 

Data at 809831 h = 9800h 

Data at 809883h = Oh 

LUFLV UF N Z V C = 00 

After Instruction: 

AR2 = 809831 h 

R5 = 800000h 

R2 = 809800h 

R6 = ODCh = 220 

AR1 = 809882h 

Data at 809831 h = 9800h 

Data at 809883h = ODCh = 220 

LUFLV UF N Z V C = 00 
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POP Integer POP 



Syntax 


POP dst 


Operation 


*SP >dst 


Operands 


dst register (Rn, < n < 27) 


Encoding 





31 




24 23 






1615 87 


i i 




1 1 

1 


1 1 1 

110 


1 

1 


1 I I I 

dst 


1 I I I I 1 1 1 1 1 1 I I I 1 

0000000000000000 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The top of the current system stack is popped and loaded into the dst register 
(32 LSBs). The top of the stack is assumed to be a signed integer. The POP 
is performed with a postdecrement of the stack pointer. The exponent bits of 
an extended precision register (R7-R0) are left unmodified. 



1 

These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

POP R3 

Before Instruction: 

SP = 809856h 

R3 = 012DAh = 4,826 

Data at 809856h = 0FFFF0DA4h = - 62,044 

LUFLV UF N Z V C = 00 

After Instruction: 

SP = 809855h 

R3 = 0FFFF0DA4h = -62,044 

Data at 809856h = 0FFFF0DA4h = - 62,044 

LUF LV UF N Z V C = 1 



R0. 
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POPF [POP [Floating-Point 



Syntax 


POPF dst 


Operation 


*SP » dstl 


Operands 


dst register (Rn, < n < 7) 


Encoding 





31 


24 23 






1615 87 


i i 




I I I I I 

1 110 1 


1 

1 


I I 1 I 

dst 


I I I I I I I I i i i i i i i 

0000 000000000000 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The top of the current system stack is popped and loaded into the dst register 
(32 MSBs). The top of the stack is assumed to be a floating-point number. The 
POP is performed with a postdecrement of the stack pointer. The 8 LSBs of 
an extended precision register (R7-R0) are zero filled. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

UF 0. 

LV Unaffected. 

N 1 if a negative result is generated, otherwise. - 

Z 1 if a zero result is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

POPF R4 

Before Instruction: 

SP = 80984Ah 

R4 = 025D2E0123h = 6.91186578e + 00 

Data at 80984Ah = 5F2C1302h = 5.32544007e + 28 

LUFLV UF N Z V C = 

After Instruction: 



SP = 809849h 

R4 = 5F2C1 30200h = 5.32544007e + 28 

Data at 80984Ah = 5F2C1302h = 5.32544007e + 28 

LUFLVUFNZVC = 
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Assembly Language Instructions 



PUSH Integer PUSH 



Syntax 
Operation 
Operands 
Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



PUSH src 

src -» *++SP 

src register (Rn, < n < 27) 



31 




24 23 






16 15 


87 


I l 




1 1 

1 


1 1 1 

1110 


1 

1 


1 1 1 1 

src 


I I I I 




I I I I 1 I I I I I I 

000000000000 



The contents of the src register (32 LSBs) are pushed on the current system 
stack. The src is assumed to be a signed integer. The PUSH is performed with 
a preincrement of the stack pointer. The integer or mantissa portion of an ex- 
tended precision register (R7-R0) is saved with this instruction. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

PUSH R6 

Before Instruction: 



SP = 8098AEh 

R6 = 815Bh = 33,115 

Data at 8098AFh = Oh 

LUFLV UF N Z V C = 







After Instruction: 



SP = 8098AFh 

R6 = 815Bh = 33,115 

Data at 8098AFh = 815Bh = 33,115 

LUF LV UF N Z V C = 
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PUSHF PUSH Floating-Point 



Syntax 
Operation 
Operands 
Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



31 



I I 





PUSHF src 

src -» *++SP 

src register (Rn, < n < 7) 

24 23 



I I I I I 

1 1111 



1 



t — i — r 

src 



16 15 



87 



i — i — i — i — i — i — i — i — i — i — i — i — i — i — r~ 

0000 000000000 000 



The contents of the src register (32 MSBs) are pushed on the current system 
stack. The src is assumed to be a floating-point number. The PUSH is per- 
formed with a preincrement of the stack pointer. The 8 LSBs of the mantissa 
are not saved. (Note the difference in R2 and the value on the stack in the ex- 
ample below.) 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

PUSHF R2 

Before Instruction: 



SP = 809801 h 

R2 = 025C1 28081 h = 6.87725854e + 00 

Data at 809802h = Oh 

LUF LV UF N Z V C = 







After Instruction: 



SP = 809802h 

R2 = 025C1 28081 h = 6.87725854e + 00 

Data at 809802h = 025C1280h = 6.87725830e + 00 

LUFLV UF N Z V C = 00 
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Assembly Language Instructions 



Return From Interrupt Conditionally RETIcond 



Syntax 


RETIcond 


Operation 


If cordis true: 




*SP >PC 




1 ~> ST (GIE). 




Else, continue. 


Operands 


None 


Encoding 





31 




24 23 






16 15 


87 


I I 

1 1 


I 

1 1 


I 1 1 




1 




1 1 1 1 

cond 


I I I I 




I I 1 I I I I I i I I 

000000000000 



Description 



A conditional return is performed. If the condition is true, the top of the stack 
is popped to the PC, and 1 is written to the global interrupt enable (GIE) bit of 
the status register. This has the effect of enabling all interrupts for which the 
corresponding interrupt enable bit is a 1 . 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Section 10.2 for a list of condition mnemonics, encoding, and 
flags). Condition flags are set on a previous instruction only when the destina- 
tion register is one of the extended-precision registers (R7-R0) or when one 
of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, or TSTB3) 
is executed. 



Cycles 
Status Bits 



Mode Bit 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 
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RETIcond Return From Interrupt Conditionally 

Example RETINZ 

Before Instruction: 



PC = 456h 
SP = 809830h 
ST = 0h 
Dataat809830h = 123h 

LUFLV UF N Z V C = 00 

After Instruction: 

PC = 123h 

SP = 80982Fh 

ST = 2000h 

Data at 809830h = 123h 

LUFLV UF N Z V C = 00 



1 0-1 52 Assembly Language Instructions 



Return From Subroutine Conditionally RETScond 



Syntax 


RETScond 


Operation 


If cond is true: 




*SP >PC. 




Else, continue. 


Operands 


None 


Encoding 





31 




24 23 






16 15 


87 


I 1 

1 1 


I 

1 1 


I i I 

1 


1 




1 1 1 1 

cond 


I I 1 I 




1 1 1 i i i i i i i i 

0000000 00000 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



A conditional return is performed. If the condition is true, the top of the stack 
is popped to the PC. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Section 10.2 for a list of condition mnemonics, encoding, and 
flags). Condition flags are set on a previous instruction only when the destina- 
tion register is one of the extended-precision registers (R7-R0) or when one 
of the compare instructions (CMPF, CMPF3, CMPI, CMPI3, TSTB, orTSTB3) 
is executed. 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

RETSGE 

Before Instruction: 







PC = 123h 




SP = 80983Ch 




Data at 80983Ch = 456h 




LUF LV UF N Z V C = 


= 


After Instruction: 





PC = 456h 

SP = 80983Bh 

Data at 80983Ch = 456h 

LUF LV UF N Z V C = 
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RND Round Floating-Point 



Syntax 


RND src, dst 


Operation 


md( src) -> dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 7) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



t — r - 





dst register (Rn, < n < 7) 



24 23 



i i — i — i — r 

10 10 



t — i — r 
dst 



16 15 



i — i — i — i — i — i — r 



~i — r 

src 



t — r 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The result of rounding the src operand is loaded into the dst register.The src 
operand is rounded to the nearest single-precision floating-point value. If the 
src operand is exactly half-way between two single-precision values, it is 
rounded to the most positive of those values. 



1 

These condition flags are modified only if the destination register is R7 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is affected by OVM bit value. 

RND R5,R2 

Before Instruction: 



R0. 



R5 = 0733C1 6EEFh = 1 .79755599e + 02 
R2 = 0h 

LUFLV UF N Z V C = 00 

After Instruction: 

R5 = 0733C1 6EEFh = 1 .79755599e + 02 
R2 = 0733C16F00h = 1.79755600e + 02 
LUFLV UF N Z V C = 00000 



10-154 



Assembly Language Instructions 



Rotate Left ROL 



Syntax 
Operation 
Operands 
Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



31 



- 1 — r— 





ROL dst 

dst left-rotated 1 bit -> dst 
dst register (Rn, < n < 27) 



2423 

t — i — i — i — r 



10 11 



1 1 



t — i — i — r 

dst 



16 15 87 

~1 1 1 1 1 1 — I 1 1 1 1 1 1 1 I 

0000000000000000 



The contents of the dst operand are left-rotated one bit and loaded into the 
dst register. This is a circular rotate with the MSB transferred into the LSB. 

Rotate left: 

C < — r-dst-<- 



->■ 



1 

These condition flags are modified only if the destination register is R7 — RO. 
LUF Unaffected. 
LV Unaffected. 
UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Set to the value of the bit rotated out of the high-order bit. Unaffected 

if dst is not R7 — R0. 

OVM Operation is not affected by OVM bit value. 

ROL R3 

Before Instruction: 

R3 = 80025CD4h 

LUFLV UFNZV C = 

After Instruction: 

R3 = 0004B9A9h 

LUF LV UF N Z V C = 1 
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ROLC Rotate Left Through Carry 



Syntax 
Operation 
Operands 
Encoding 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 1 



ROLC dst 

dst left-rotated 1 bit through carry bit -» dst 
dst register (Rn, < n < 27) 



31 


24 23 






1615 87 


I I 




I I 1 I 1 

10 10 


1 

1 1 


1 I 1 1 

dst 


I I 1 1 I I 1 1 i i i i i i i 

0000000000000000 



The contents of the dst operand are left-rotated one bit through the carry bit 
and loaded into the dst register. The MSB is rotated to the carry bit, at the same 
time the carry bit is transferred to the LSB. 

Rotate left through carry bit: 

* — C — < — dst -<- 



->- 



1 

These condition flags are modified only if the destination register is R7 — R0. 
LUF Unaffected. 
LV Unaffected. 
UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Set to the value of the bit rotated out of the high-order bit. If c/sf is not 

R7 — R0, then C is shifted into the dst but not changed. 

OVM Operation is not affected by OVM bit value. 

ROLC R3 

Before Instruction: 



R3 = 00000420h 

LUF LV UF N Z V C 

After Instruction: 



1 



R3 = 000000841 h 

LUF LV UF N Z V C = 
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Assembly Language Instructions 



Rotate Left Through Carry ROLC 



Example 2 ROLC R3 

Before Instruction: 



R3 = 80004281 h 

LUFLV UF N Z V C = 00 

After Instruction: 

R3 = 00008502h 

LUFLV UF N Z V C = 1 
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ROR Rotate Right 



Syntax 
Operation 
Operands 
Encoding 



ROR dst 

dst right-rotated 1 bit through carry bit -> dst 
dst register (Rn, < n < 27) 



31 



1 — i - 

000 



24 23 



Description 



— 1 — 1 — 1 — 1 — r 

10 0101 



1 1 



t — i — r 

dst 



16 15 



87 



~i — 1 — 1 — 1 — 1 — 1 — 1 — i — i — i — i — i — 1 — 1 — r~ 

0000000000000000 



The contents of the dst operand are right-rotated one bit and loaded into the 
dst register. The LSB is rotated into the carry bit and also transferred into the 
MSB. 



Rotate right: 

^- dst 

$ — 



■> C 



Cycles 
Status Bits 



Mode Bit 
Example 



1 

These condition flags are modified only if the destination register is R7 — R0. 
LUF Unaffected. 
LV Unaffected. 
UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Set to the value of the bit rotated out of the high-order bit. Unaffected 

if dst is not R7 — R0. 

OVM Operation is not affected by OVM bit value. 

ROR R7 

Before Instruction: 

R7 = 00000421 h 

LUFLV UF N Z V C = 00 

After Instruction: 

R7 = 8000021 Oh 

LUF LV UF N Z V C = 1 1 
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Assembly Language Instructions 



Syntax 
Operation 
Operands 
Encoding 



Rotate Right Through Carry RORC 



RORC dst 

dst right- rotated 1 bit through carry bit -» dst 
dst register (Rn, < n < 27) 



31 




24 23 






16 15 






87 












1 1 

1 


II I 

110 


I 

1 1 


riii 

dst 


i 
1 1 


1 

1 


1 

1 


1 1 I 1 I 

111111 


1 

1 


I 1 

1 1 1 


I 

1 1 



Description 



The contents of the dst operand are right-rotated one bit through the status reg- 
ister's carry bit. This could be viewed as a 33-bit shift. The carry bit value is ro- 
tated into the MSB of the dst, while at the same time, the dst LSB is rotated into 
the carry bit. 

Rotate right through carry bit: 



C — ^ — dst — * 



Cycles 
Status Bits 



Mode Bit 
Example 



1 

These condition flags are modified only if the destination register is R7 — R0. 
LUF Unaffected. 
LV Unaffected. 
UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Set to the value of the bit rotated out of the high-order bit. If dst is not 

R7 — R0, then C is shifted in but not changed. 

OVM Operation is not affected by OVM bit value. 

RORC R4 

Before Instruction: 

R4 = 80000081 h 

LUF LV UF N Z V C = 1 

After Instruction: 

R4 = 40000040h 

LUFLV UF N Z V C = 1 
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RPTB Repeat Block 

Syntax 
Operation 

Operands 
Encoding 



Description 



RPTB src 

src -^ RE 
1 -> ST (RM) 
Next PC -> RS 

src long-immediate addressing mode 



31 


24 23 


16 15 




87 





I i I 

1 1 


I I 1 I 

10 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 

src 



Cycles 
Status Bits 



Mode Bit 
Example 



RPTB allows a block of instructions to be repeated a number of times without 
any penalty for looping. This instruction activates the block repeat mode of up- 
dating the PC. The src operand is a 24-bit unsigned immediate value that is 
loaded into the repeat end address (RE) register. A 1 is written into the repeat 
mode bit of status register St (RM) to indicate that the PC is being updated in 
the repeat mode. The address of the next instruction is loaded into the repeat 
start address (RS) register. 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

RPTB 127h 

Before Instruction: 

PC = 123h 
ST = Oh 
RE = 0h 
RS = 0h 

LUFLV UF N Z V C = 00 

After Instruction: 

PC = i24h 
ST=100h 
RE = 127h 
RS = 124h 
LUFLV UF N Z V C = 00 
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Assembly Language Instructions 



Repeat Single RPTS 



Syntax 


RPTS src 


Operation 


src -> RC 




1 -» ST (RM) 




1 ->S 




Next PC -> RS 




Next PC -» RE 


Operands 


src general addressing modes (G) 




register 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



~ i — r~ 





24 23 

"~i — i — i — i — r— 

10 1 1 1 



— i — i — i — r 

110 11 



16 15 87 

t — i — i — i — i — i — i — i — i — i — i — i — i — i — r 



Description 



The RPTS instruction allows a single instruction to be repeated a number of 
times without any penalty for looping. Fetches can also be made from the in- 
struction register (IR), thus avoiding repeated memory access. 

The src operand is loaded into the repeat counter (RC). A 1 is written into the 
repeat mode bit of the status register ST (RM). A 1 is also written into the repeat 
single bit (S). This indicates that the program fetches are to be performed only 
from the instruction register. The next PC is loaded into the repeat end address 
(RE) register and the repeat start address (RS) register. 

For the immediate mode, the srcoperand is assumed to be an unsigned integer 
and is not sign-extended. 



Cycles 


4 




Status Bits 


LUF 


Unaffected. 




LV 


Unaffected. 




UF 


Unaffected. 




N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 


Mode Bit 


OVMI 


Doeration is n 
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RPTS Repeat Single 

Example rpts AR5 

Before Instruction: 



PC = 123h 

ST = 0h 

RS = Oh 

RE = 0h 

RC = 0h 

AR5 = OFFh 

LUFLV UF N Z V C = 00 

After Instruction: 

PC = 124h 

ST=100h 

RS = 124h 

RE = 124h 

RC = OFFh 

AR5 = OFFh 

LUFLVUFNZVC = 0000 



1 0-1 62 Assembly Language Instructions 



Signal, Interlocked SIGI 



Syntax 


SIGI 


Operation 


Signal interlocked operation. 
Wait for interlock acknowledge 
Clear interlock. 


Operands 


None 


Encoding 





31 



— i — r - 





24 23 

-1 — i — i — i — r - 

10 110 







16 15 87 

— i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — i — r~ 

00000000 0000000000000 



Description 



An interlocked operation is signaled over XF0 and XF1 . After the interlocked 
operation is acknowledged, the interlocked operation ends. SIGI ignores the 
external ready signals. Refer to Section 6.4 on page 6-1 for detailed informa- 
tion. 



Cycles 


1 




Status Bits 


LUF 


Unaffected. 




LV 


Unaffected. 




UF 


Unaffected. 




N 


Unaffected. 




Z 


Unaffected. 




V 


Unaffected. 




c 


Unaffected. 


Mode Bit 


OVM( 


Dperation is n 



10-163 



STF Store Floating-Point 

Syntax 

Operation 

Operands 



Encoding 



31 



t — r~ 





Description 

Cycles 
Status Bits 



Mode Bit 
Example 



STF src, dst 

src -> dst 

src register (Rn, < n < 7) 

dst general addressing modes (G): 

1 direct 

1 indirect 



24 23 

~~i — i — i — i — r - ■ 

10 10 



t — i — i — r 

src 



16 15 87 ___■ 

t — i — j — i — i — i — i — i — i — i — i — i — i — i — r 



dst 



The src register is loaded into the dst memory location. The src and dst oper- 
ands are assumed to be floating-point numbers. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

STF R2,@98Alh 

Before Instruction: 

DP = 80h 

R2 = 052C501900h = 4.30782204e + 01 

Data at 8098A1 h = Oh 

LUFLV UF N Z V C^0 00 

After Instruction: 

DP = 80h 

R2 = 052C501900h = 4.30782204e + 01 

Data at 8098A1 h = 52C501 9h = 4.30782204e + 01 

LUFLV UF N Z V C = 00 
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Assembly Language Instructions 



Store Floating-Point, Interlocked STFI 



Syntax 


STFI src, dst 


Operation 


src -» dst 

Signal end of interlocked operation 


Operands 


src register (Rn, < n < 7) 




dst general addressing modes (G): 
1 direct 
1 indirect 



Encoding 



31 



i l 





24 23 

-1 — I — I — I — I - 

10 10 1 



t — i — i — r 

src 



16 15 87 

t — i — i — i — i — i — i— i — i — i — i — i — i — i — r 



dst 



Description 



Cycles 
Status Bits 



Mode Bit 
Example 



The src register is loaded into the dst memory location. An interlocked opera- 
tion is signaled over pins XF0 and XF1 . The srcand dst operands are assumed 
to be floating-point numbers. Refer to Section 6.4 on page 6-1 for detailed in- 
formation. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

STFI R3,*-AR4 

Before Instruction: 

R3 = 0733C00000h = 1 .79750e + 02 

AR4 = 80993Ch 

Data at 80993Bh = Oh 

LUF LV UF N Z V C = 

After Instruction: 

R3 = 0733C00000h = 1 .79750e + 02 

AR4 = 80993Ch 

Data at 80993Bh = 733C000h = 1 .79750e + 02 

LUFLV UF N Z V C = 00 
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STF| |STF Parallel Store Floating-Point 



Syntax 



Operation 



Operands 



Encoding 



31 



1 1 



STF src2, dst2 
|| STF srd,dst1 

src2 -» dst2 
|| srd -^ dstl 

srd register (Rn1 , < n1 < 7) 

dstl indirect (disp = 0, 1, IRO, IR1) 

src2 register (Rn2, < n2 < 7) 

dst2 indirect (disp = 0, 1, IRO, IR1) 



— i — i — i — i — 





24 23 

— i — r 

src2 



-I — I — 





16 15 



i — r 
srd 



t — i — i — i — i — r 

dstl 



"i — i — i — i — r 

dst2 



i — r 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



Two STF instructions are executed in parallel. Both srd and src2 are assumed 
to be floating-point numbers. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

STF R4,*AR3-- 
| | STF R3, *++AR5 

Before Instruction: 

R4 = 070C800000h = 1 .4050e + 02 

AR3 = 809835h 

R3 = 0733C00000h = 1 .79750e + 02 

AR5 = 8099D2h 

Data at 809835h = Oh 

Data at 8099D3h = Oh 

LUF LV UF N Z V C = 

After Instruction: 

R4 = 070C800000h = 1 .4050e + 02 

AR3 = 809834h 

R3 = 0733C00000h = 1 .79750e + 02 

AR5 = 8099D3h 

Data at 809835h = 070C8000h = 1 .4050e + 02 

Data at 8099D3h = 0733C000h = 1 .79750e + 02 

LUFLV UF N Z V C = 00 
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Assembly Language Instructions 



Syntax 

Operation 

Operands 



Encoding 



31 



- 1 — r~ 





Description 

Cycles 
Status Bits 



Mode Bit 
Example 



Store Integer STI 



STI src, dst 

src -» dst 

src register (Rn, < n < 27) 

dst general addressing modes (G): 
1 direct 
1 indirect 



24 23 



- 1 — i — i — i — r 

10 10 1 



t r 

src 



16 15 



t i i i i i r 



87 



^ r 

dst 



i — i — i — i — i — r 



The src register is loaded into the dst memory location. The src and dst oper- 
ands are assumed to be signed integers. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 
STI R4,@982Bh 
Before Instruction: 



DP = 80h 

R4 = 42BD7h = 273.367 

Data at 80982Bh = 0E5FCh = 58,876 

LUFLV UF N Z V C = 







After Instruction: 



DP = 80h 

R4 = 42BD7h = 273,367 

Data at 80982Bh = 42BD7h = 273,367 

LUF LV UF N Z V C = 
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STII 



Store Integer, Interlocked 



Syntax 
Operation 

Operands 



Encoding 



31 



l l 





Description 

Cycles 
Status Bits 



Mode Bit 
Example 



STII src, dst 

src -» dst 

Signal end of interlocked operation. 

src register (Rn, < n < 27) 

dst general addressing modes (G): 
1 direct 
1 indirect 



24 23 



— T — I 1 1 1 - 

10 1 1 1 



t — i — r 
src 



16 15 



t — i — i — i — i — i — r 



87 



t — r 

dst 



t — i — i — i — i — r 



The src register is loaded into the dst memory location. An interlocked opera- 
tion is signaled over pins XF0 and XF1 . The src and dst operands are assumed 
to be signed integers. Refer to Section 6.4 for detailed information. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

STII Rl,@98AEh 

Before Instruction: 

DP = 80h 

R1 = 78Dh 

Data at 8098AEh = 25Ch 

After Instruction: 

DP = 80h 

R1 = 78Dh 

Data at 8098AEh = 78Dh 
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Assembly Language Instructions 



Parallel STI and STI STI 1 1 STI 



Syntax 



Operation 



Operands 



Encoding 



31 



T — 
1 1 



STI src2, dst2 
|| STI srd, dstl 

src2 -> dst2 
|| srd — » dstl 

srd register (Rn1 , < n1 < 7) 

dstl indirect (disp = 0, 1, IRO, IR1) 

src2 register (Rn2, < n2 < 7) 

dst2 indirect (disp = 0, 1 , IRO, IR1) 



-i — i — i — r— 

1 



24 23 



t — r 

src2 



1615 



I I 





srd 



t — i — i — r 

dstl 



87 



t — i — i — i — r 

dst2 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



Two integer stores are performed in parallel. If both stores are executed to the 
same address, the value written is that of STI src2, dst2. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



OVM Operation is not affected by OVM bit value. 

STI R0, *++AR2 (IRO) 
| | STI R5,*AR0 

Before Instruction: 

R0 = ODCh = 220 

AR2 = 809830h 

IR0 = 8h 

R5 = 35h = 53 

AR0 = 8098D3h 

Data at 809838h = Oh 

Data at 8098D3h = Oh 

LUFLV UF N Z V C = 00 
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STI 1 1 STI Parallel STI and STI 



After Instruction: 

R0 = 0DCh = 220 

AR2 = 809838h 

IR0 = 8h 

R5 = 35h = 53 

ARO = 8098D3h 

Data at 809838h = ODCh = 220 

Data at 8098D3h = 35h = 53 

LUFLV UF N Z V C = 00 



1 0- 1 70 Assembly Language Instructions 



Syntax 


SUBB src, dst 


Operation 


dst-src-C -> dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 27) 




1 direct 




1 indirect 




1 1 immediate 



Subtract Integer With Borrow SUBB 



Encoding 



31 



T T - 





dst register (Rn, < n < 27) 



2423 

i — i — i — i — r 



10 1 10 1 



t — r 



16 15 



dst 



t — i — r 



t — r 



87 

-i — r 



src 



t — i — i — r 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the dst, src, and C operands is loaded into the dst register. 
The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

OVM Operation is affected by OVM bit value. 

SUBB *AR5++(4) , R5 

Before Instruction: 

AR5 = 809800h 

R5 = OFAh = 250 

Data at 809800h = 0C7h = 199 

LUF LV UF N Z V C = 1 

After Instruction: 

AR5 = 809804h 

R5 = 032h = 50 

Data at 809800h = 0C7h = 1 99 

LUFLV UF N Z V C = 00 



10-171 



SU B B3 Subtract Integer With Borrow, 3-Operand 



•vwssssiwssswssftws-'SS: 



S\-WSSSSSftWSSriW!Wfti 



Syntax 

Operation 

Operands 



Encoding 



31 



t — r~ 

1 



SUBB3 src2, srd, dst 

srd - src2 - C -> dst 

srd three-operand addressing modes (T): 
register (Rn1 , < M < 27) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rn1 , <> n1 < 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register(Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 

1 indirect (disp = 0, 1, IRO, IR1) 
1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, < n < 27) 



24 23 

- 1 — I — I — I — r - 

1 10 



i — i — i — r 

dst 



1615 



t — i — i — i — i — r 

srd 



87 



t — i — i — i — i — r 

src2 



Description 

Cycles 
Status Bits 



Mode Bit 



The difference of the srd and src2 operands and the C (carry) flag is loaded 
into the dst register. The srd, src2, and dst operands are assumed to be 
signed integers. 

1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

OVM Operation is affected by OVM bit value. 
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Assembly Language Instructions 



Subtract Integer With Borrow, 3-Operand SUB B3 



Example SUBB3 R5, *AR5++ (IRO) , RO 

Before Instruction: 



AR5 = 809800h 

IRO = 4h 

R5 = 0C7h = 199 

R0 = 0h 

Data at 809800h = OFAh = 250 

LUF LV UF N Z V C = 1 

After Instruction: 

AR5 = 809804h 

IRO = 4h 

R5 = 0C7h = 199 

RO = 32h = 50 

Data at 809800h = OFAh = 250 

LUFLV UF N Z V C = 



10-173 



S U B C Subtract In teger Conditionally 



Syntax 


SUBC src, dst 


Operation 


U(dst-src>0): 




(dst- src « 1 ) OR 1 -» dst 




Else: 




dst « 1 -> dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 27) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



- 1 — r~ 




dst register (Rn, < n < 27) 



24 23 

-i — i — i — i — i — 

10 1 110 



t — i — r 

dst 



16 15 87 

t — i — i — i — i — i — i — i — r 



t — i — i — i — i — r 



Description 



Cycles 
Status Bits 



The srcoperand is subtracted from the dst operand. The dst operand is loaded 
with a value dependent upon the result of the subtraction. If (dst- src) is greater 
than or equal to zero, then (dst- src) is left-shifted one bit, the least significant 
bit is set to 1 , and the result is loaded into the dst register. If (dst -src) is less 
than zero, dstis left-shifted one bit and loaded into the c/sf register. The dst and 
src operands are assumed to be unsigned integers. 

SUBC may be used to perform a single step of a multibit integer division. See 
subsection 11 .3.4 for a detailed description. 

1 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Assembly Language Instructions 



Subtract Integer Conditionally SUBC 



Example 1 



SUBC @98C5h,Rl 

Before Instruction: 

DP = 80h 

R1 =04F6h = 1270 

Data at 8098C5h = 492h = 1 1 70 

LUFLV UF N Z V C = 00 

After Instruction: 

DP = 80h 

R1 = 0C9h = 201 

Data at 8098C5h = 492h = 1170 

LUFLV UF N Z V C = 00 



Example 2 



SUBC 3000, R0 (3000 
Before Instruction: 



0BB8h) 



R0 = 07D0h = 2000 

LUF LV UF N Z V C = 

After Instruction: 







R0 = OFAOh = 4000 

LUFLV UF N Z V C = 00 
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SUBF Subtract Floating-Point 



Syntax 


SUBF src, dst 


Operation 


dst- src ^ dst 


Operands 


src general addressing modes (G) 




register (Rn, < n < 7) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



T [— 





dst register (Rn, < n < 7) 



24 23 



—\ — I — I — I — I - 

10 1 111 



n — r 

dst 



16 15_ 87 

i — i — i — i — i — i — i — i — i — i — i — i — r 



t — r 



src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the dst operand minus the src operand is loaded into the 
dst register. The dst and src operands are assumed to be floating-point num- 
bers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if an floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an floating-point overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

SUBF *AR0--(IR0) , R5 

Before Instruction: 

ARO = 809888h 

IR0 = 80h 

R5 = 0733C00000h = 1 .79750000e + 02 

Data at 809888h = 70C8000h = 1 .4050e + 02 

LUFLV UF N Z V C = 00 

After Instruction: 

ARO = 809808h 

IRO = 80h 

R5 = 051 DOOOOOOh = 3.9250e + 01 

Data at 809888h = 70C8000h = 1 .4050e + 02 

LUFLV UF N Z V C = 00 
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Assembly Language Instructions 



Subtract Floating-Point, 3-Operand SUBF3 



Syntax 

Operation 

Operands 



Encoding 



31 



t — r~ 

1 



SUBF3 src2, srd, dst 

srd - src2 -> dst 

srd three-operand addressing modes (T): 
register (Rn1, < n1 <7) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rn1, <n1 <7) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 7) 

1 register (Rn2, < n2 < 7) 

10 indirect (disp = 0, 1 , IRO, IR1) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

dst register (Rn, < n < 7) 



24 23 

— r 



- 1 — i — i — r 

1 10 1 



16 15 



t i r 

dst 



t i i I i i r 

srd 



t i I I i i r 

src2 



Description 

Cycles 
Status Bits 



The difference of the srd and src2 operands is loaded into the dst register. 
The srd, src2, and dst operands are assumed to be floating-point numbers. 



Mode Bit 



1 

These condition flags are modified only if the destination register is R7 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if an floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an floating-point overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 



R0. 



10-177 



SUBF3 Subtract Floating-Point, 3-Operand 



Example 1 



Example 2 



SUBF3 *ARO--(IRO) ,*AR1,R4 

Before Instruction: 

ARO = 809888h 

IRO = 80h 

AR1 = 809851 h 

R4 = 0h 

Data at 809888h = 70C8000h = 1 .4050e + 02 

Data at 809851 h = 733C000h = 1 .79750e + 02 

LUFLV UF N Z V C = 00 

After Instruction: 

ARO = 809808h 

IR0 = 80h 

AR1 = 809851 h 

R4 = 51 DOOOOOOh = 3.9250e + 01 

Data at 809888h = 70C8000h = 1 .4050e + 02 

Data at 809851 h = 733C000h = 1 .79750e + 02 

LUFLV UF N Z V C = 00 

SUBF3 R7,R0,R6 

Before Instruction: 

R7 = 57B400000h = 6.281 250e + 01 
RO = 34C200000h = 1. 275781 25e + 01 
R6 = 0h 

LUFLV UF N Z V C = 00 

After Instruction: 

R7 = 57B400000h = 6.281 250e + 01 

RO = 34C200000h = 1 .275781 25e + 01 

R6 = 5B7C80000h= - 5.00546875e + 01 

LUF LV UF N Z V C = 10 
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Assembly Language Instructions 



Parallel SUBF3 and STF SUBF3I ISTF 



Syntax 

Operation 

Operands 



Encoding 



31 



SUBF3 srrt , src2, dstl 
STF src3, dst2 

src2- srd -> dstl 
src3 -» dst2 



srd 
src2 
dstl 
src3 
dst2 



register (Rn1, 0<n1 <7) 
indirect (disp = 0, 1, IRO, IR1) 
register (Rn2, < n2 < 7) 
register (Rn3, < n3 < 7) 
indirect (disp = 0, 1, IRO, IR1) 



T — 
1 1 



24 23 



16 15 



87 



- 1 — I — I - 

10 1 



i r 

dstl 



i — r 

srd 



t — r 
src3 



t — i — i — i — r 

dst2 



t — r 



t — i — i — i — i — r 

src2 



Description 



Cycles 
Status Bits 



A floating-point subtraction and a floating-point store are performed in parallel. 
All registers are read at the beginning and loaded at the end of the execute 
cycle. This means that if one of the parallel operations (STF) reads from a reg- 
ister and the operation being performed in parallel (SUBF3) writes to the same 
register, then STF accepts as input the contents of the register before it is modi- 
fied by the SUBF3. 

If src3 and dstl point to the same location, src3 is read before the write to dst 1 . 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF 

LV 

UF 

N 

Z 

V 

c 



1 if a floating-point underflow occurs, unchanged otherwise. 

1 if an floating-point overflow occurs, unchanged otherwise. 

1 if a floating-point underflow occurs, otherwise. 

1 if a negative result is generated, otherwise. 

1 if a zero result is generated, otherwise. 

1 if an floating-point overflow occurs, otherwise. 



Mode Bit 



Unaffected. 
OVM Operation is not affected by OVM bit value. 
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SUBF3NSTF Parallel SUBF3 and STF 



Example SUBF3 Rl , *-AR4 ( IR1 ) , RO 

| | STF R7,*+AR5 (IRO) 

Before Instruction: 



R1 = 


057B400000h 


= 6.281 25e 


+ 01 






AR4 


= 8098B8h 










IR1 = 


= 8h 










R0 = 


Oh 










R7 = 


0733C00000h 


= 1 .79750e 


+ 02 






AR5 


= 809850h 










IR0 = 


= 10h 










Data at 8098B0h = 


70C8000h = 


: 1 .4050e 


+ 02 


Data at 809860h = ( 


Dh 








LUF 


LV UF N Z 


V C = 











After Instruction: 















R1 = 057B400000h = 6.281 25e + 01 

AR4 = 8098B8h 

IR1 =8h 

RO = 061 B600000h = 7.768750e + 01 

R7 = 0733C00000h = 1 .79750e + 02 

AR5 = 809850h 

IRO = 10h 

Data at 8098B0h = 70C8000h = 1 .4050e + 02 

Data at 809860h = 733C000h = 1 .79750e + 02 

LUF LV UF N Z V C = 
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Subtract Integer SUBI 



Syntax 


SUBI src, dst 


Operation 


dst- src -^ dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 27) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



I I 





dst register (Rn, < n < 27) 



24 23 



—i — i — i — i — r~ 

110 



t — i — r 
dst 



16 15 87 . 

t — i — i — i — i — i — i — i — i — i — i — i — i — i — r 



src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the dst operand minus the src operand is loaded into the dst 
register. The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

OVM Operation is affected by OVM bit value. 

SUBI 220, R7 

Before Instruction: 



R7 = 226h = 550 






LUF LV UF N Z V C = 








After Instruction: 







R7 = 14Ah = 330 

LUF LV UF N Z V C = 
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SUB13 Subtract Integer, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



31 



—\ — r~ 
o o 1 



SUBI3 src2, srd, dst 

srd - src2 -> dst 

srd three-operand addressing modes (T): 
register (Rn1 , < n1 < 27) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rn1 , < n1 < 27) 
11 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 
1 register (Rn2, < n2 < 27) 

10 indirect (disp = 0, 1, IRO, IR1) 

11 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, < n < 27) 



24 23 



I I I I I 

1 110 



t — i — r 
dst 



16 15 



i — i — r 



n r 

srd 



t — r 



i i i r 

src2 



i — r 



Description 

Cycles 
Status Bits 



The difference of the srd operand minus the src2 operand is loaded into the 
dst register. The srd, src2, and dst operands are assumed to be signed inte- 
gers. 



Mode Bit 



1 

These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

OVM Operation is affected by OVM bit value. 



R0. 
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Assembly Language Instructions 



Subtract Integer, 3 : Operand SUB\3 



Example 1 



Example 2 



SUBI3 R7,R2,R0 

Before Instruction: 

R2 = 0866h = 2150 

R7 = 0834h = 2100 

R0 = 0h 

LUF LV UF N Z V C = 

After Instruction: 

R2 = 0866h = 2150 
R7 = 0834h = 2100 
RO = 032h = 50 

LUFLV UF N Z V C = 

SUBI3 *-AR2 (1) ,R4,R3 
Before Instruction: 







10 



AR2 = 80985Eh 

R4 = 0226h = 550 

R3 = 0h 

Data at 80985Dh = ODCh = 220 

LUF LV UF N Z V C = 

After Instruction: 







AR2 = 80985Eh 

R4 = 0226h = 550 

R3 = 014Ah = 330 

Data at 80985Dh = ODCh = 220 

LUF LV UF N Z V C = 
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SUBI3JJSTI Parallel SUBI3 and STI 



Syntax 


SUBI3 srd, src2, dstl 




|| STI src3,dst2 


Operation 


src2-srd -> dstl 




|| src3^dst2 


Operands 


srd register (Rn1 , < n1 < 7) 




src2 indirect (disp = 0, 1, IRO, IR1) 




dstl register (Rn2, < n2 < 7) 




src3 register (Rn3, < n3 < 7) 




dst2 indirect (disp = 0, 1, IRO, IR1) 



Encoding 



31 



24 23 



1 1 



^ IT I 

10 110 



t — r 
dstl 



16 15 



t — r 
srd 



t — r 
src3 



t — r 



i — i — i — i — r 

dst2 



87 



t — i — i — i — i — r 

src2 



Description 



Cycles 
Status Bits 



An integer subtraction and an integer store are performed in parallel. All regis- 
ters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (SUBI3) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by the 
SUBI3. 

If src3 and dstl point to the same location, src3 is read before the write to dst 1. 

1 



Mode Bit 



These condition flags are modified only if the destination register is R7 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

OVM Operation is affected by OVM bit value. 



R0. 
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Assembly Language Instructions 



Parallel SUBI3 and STI SUBI3I ISTI 



Example 



SUBI3 R7,* 


+AR2 (IRO) ,R1 


| | STI R3,* 


+ +AR7 




Before Instructor 


i: 




R7 = 14h = 20 




AR2 = 80982Fh 






IR0 = 10h 






R1 =0h 






R3 = 35h = 53 






AR7 = 80983Bh 






Data at 80983Fh = 


0DCh = 


= 220 


Data at 80983Ch = 


Oh 




LUF LV UF N Z 


V C = 


= 


After Instruction: 











R7 = 14h = 20 

AR2 = 80982Fh 

IRO = 10h 

R1 = 0C8h = 200 

R3 = 35h = 53 

AR7 = 80983Ch 

Data at 80983Fh = ODCh = 220 

Data at 80983Ch = 35h = 53 

LUFLV UF N Z V C = 00 



10-185 



SUBRB Subtract Reverse Integer With Borrow 



Syntax 

Operation 

Operands 



Encoding 



31 



I I 





SUBRB src, dst 
src - dst- C^ dst 

src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 27) 



2423 

t — i — i — i — r 



110 1 



1615 



dst 



t — r 



87 

1 — r 



src 



t — i — i — i — i — r 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the src, dst, and C operands is loaded into the dst register. 
The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

OVM Operation is affected by OVM bit value. 

SUBRB R4,R6 

Before Instruction: 



R4 = 03CBh = 971 
R6 = 0258h = 600 
LUF LV UF N Z V C = 1 

After Instruction: 

R4 = 03CBh = 971 
R6 = 0172h = 370 
LUFLV UF N Z V C = 00 
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Assembly Language Instructions 



Subtract Reverse Floating-Point SUBRF 



Syntax 


SUBRF src, dst 


Operation 


src - dst -> dst 


Operands 


src general addressing modes (G) 




register (Rn, < n < 7) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



l l 





dst register (Rn, < n < 7) 



24 23 



16 15 



87 



m i i i 

110 10 



i i r 

dst 



i — i — i — i — i — i — i — i — r 

src 



t — r 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the src operand minus the dsfoperand is loaded into the dst 
register.The dst and src operands are assumed to be floating-point numbers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF 1 if a floating-point underflow occurs, unchanged otherwise. 

LV 1 if a floating-point overflow occurs, unchanged otherwise. 

UF 1 if a floating-point underflow occurs, otherwise. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if a floating-point overflow occurs, otherwise. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

SUBRF @9905h,R5 

Before Instruction: 

DP = 80h 

R5 = 057B400000h = 6.281 250e + 01 

Data at 809905h = 733C000h = 1 .79750e + 02 

LUFLV UF N Z V C = 00 

After Instruction: 

DP = 80h 

R5 = 0669E00000h = 1 .1 6937500e + 02 

Data at 809905h = 733C000h = 1 .79750e + 02 

LUFLV UF N Z V C = 00 
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SUBRI Subtract Reverse Integer 



Syntax 

Operation 

Operands 



Encoding 



31 



T 1— 





SUBRI src,dst 
src - dst -> dst 

src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 

dst register (Rn, < n < 27) 



24 23 

—i — i — i — i — r - 

110 11 



t — i — i — r 

dst 



16 15 87 

t — i — i — i — i — i — i — i — i — i — i — i — i — r-T 



src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The difference of the src operand minus the dst operand is loaded into the dst 
register. The dst and src operands are assumed to be signed integers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV 1 if an integer overflow occurs, unchanged otherwise. 

UF 0. 

N 1 if a negative result is generated, otherwise. 

Z 1 if a zero result is generated, otherwise. 

V 1 if an integer overflow occurs, otherwise. 

C 1 if a borrow occurs, otherwise. 

OVM Operation is affected by OVM bit value. 

SUBRI *AR5++(IR0) , R3 

Before Instruction: 



AR5 = 809900h 


















IR0 = 8h 


















R3 = ODCh = 220 
















1 


Data at 809900h = 


226h = 


550 














LUF LV UF N Z 


V C 


= 




















After Instruction: 



















AR5 = 809908h 

IR0 = 8h 

R3 = 014Ah = 330 

Data at 809900h = 226h = 550 

LUFLVUFNZVC = 00 00 
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Assembly Language Instructions 



Software Interrupt SWI 



Syntax 
Operation 
Operands 
Encoding 



SWI 

Performs an emulation interrupt 
None 



31 






24 23 


16 15 


87 


I 

1 


I I I 

10 1 


1 


I I I 




1 I I I 1 1 I 




1 1 1 1 1 1 i i i i i i i 

00000000000000 



Description 

Cycles 
Status Bits 



The SWI instruction performs an emulator interrupt. This is a reserved instruc- 
tion and should not be used in normal programming. 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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TRAPcond 


Trap Conditionally 


Syntax 


TRAPcond N 


Operation 


-» ST(GIE) 
If cond is true: 




Next PC -> *++SP, 




Trap vector N -> PC. 




Else: 




Set ST(GIE) to original state. 
Continue. 


Operands 


N(0<N<31) 


Encoding 





31 



-I — I — I — I — I — I — 

1110 10 



24 23 



i i — r~ 





t — i — r 

cond 



16 15 



87 
~i — r 



t — i — i — i — i — i — i — i — i — i — 

00000000 000 



i — i — i — r 

N 



Description 



Cycles 
Status Bits 



Interrupts are disabled globally when is written to ST(GIE). If the condition 
is true, the contents of the PC are pushed onto the system stack, and the PC 
is loaded with the contents of the specified trap vector (N). If the condition is 
not true, ST(GIE) is set to its value before the TRAPcond instruction changes 
it. 

The TMS320C3x provides 20 condition codes that can be used with this in- 
struction (see Section 1 0.2 on page 1 0-9 for a list of condition mnemonics, 
encoding, and flags). Condition flags are set on a previous instruction only 
when the destination register is one of the extended-precision registers 
(R7-R0) or when one of the compare instructions (CMPF, CMPF3, CMPI, 
CMPI3, TSTB, or TSTB3) is executed. 



LUF 


Unaffected. 


LV 


Unaffected. 


UF 


Unaffected. 


N 


Unaffected. 


Z 


Unaffected. 


V 


Unaffected. 


c 


Unaffected. 



Mode Bit 



OVM Operation is not affected by OVM bit value. 
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Trap Conditionally TRAPcond 



Example TRAPZ 16 

Before Instruction: 



PC = 123h 

SP = 809870h 

ST = 0h 

Trap Vector 1 6 = 1 0h 

LUFLV UF N Z V C = 00 

After Instruction: 

PC = 10h 

SP = 809871 h 

Data at 809871 h = 124h 

ST = 0h 

LUFLV UF N Z V C = 00 
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TSTB Jest Bit Fields 




Syntax 


TSTB src, dst 


Operation 


dst AND src 


Operands 


src general addressing modes (G): 

register (Rn, < n < 27) 

1 direct 

1 indirect 
1 1 immediate 



Encoding 



31 



- 1 — I - 





dst register (Rn, < n < 27) 



24 23 



16 15 



87 



— T 1 1 1 1 — 

110 10 



t — i — r 

dst 



i i i i r 



1 — i — i — r 

src 



t — i — i — i — i — r 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The bitwise logical-AND of the dst and src operands is formed, but the result 
is not loaded in any register. This allows for nondestructive compares. The dst 
and src operands are assumed to be unsigned integers. 

1 

These condition flags are modified for all destination registers (R27 — R0). 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

TSTB *-AR4 (1) ,R5 

Before Instruction: 

AR4 = 8099C5h 

R5 = 898h = 2200 

Data at 8099C4h = 767h = 1 895 

LUFLV UF N Z V C = 

After Instruction: 

AR4 = 8099C5h 

R5 = 898h = 2200 

Data at 8099C4h = 767h = 1 895 

LUF LV UF N Z V C = 1 
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Test Bit Fields, 3-Operand TSTB3 



Syntax 

Operation 

Operation 



Encoding 



31 



T r~ 

1 



TSTB3 src2, srd 

srd AND src2 

srd three-operand addressing modes (T): 
register (Rn1 , < n1 < 27) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rn1 , < n1 < 27) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 1 27) 
10 indirect (disp = 0, 1 , IRO, IR1) 

1 1 indirect (disp = 0, 1 , IRO, IR1) 



24 23 

i i r i i 

1 111 



I I I I 





16 15 8 7 

T I I I I I T 



srd 



T 1 — I 1 — T 

src2 



Description 



Cycles 
Status Bits 



Mode Bit 



The bitwise logical-AND between the srd and src2 operands is formed, but 
is not loaded into any register. This allows for nondestructive compares. The 
srd and src2 operands are assumed to be unsigned integers. Although this 
instruction has only two operands, it is designated as a three operand instruc- 
tion because operands are specified in the three-operand format. 



1 

These condition flags are modified for all destination registers (R27 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 



R0). 
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TSTB3 



Test Bit Fields, 3-Operands 



Example 1 



Example 2 



TSTB3 *AR5--(IR0) , *+ARO (1) 

Before Instruction: 



AR5 = 809885h 

IRO = 80h 

ARO = 80992Ch 

Data at 809885h = 898h = 2200 

Data at 80992Dh = 767h = 1 895 

LUF LV UF N Z V C = 







After Instruction: 



AR5 = 809805h 

IR0 = 80h 

ARO = 80992Ch 

Data at 809885h = 898h = 2200 

Data at 80992Dh = 767h = 1 895 

LUFLV UF N Z V C = 1 

TSTB3 R4,*AR6--(IR0) 

Before Instruction: 

R4 = 0FBC4h 

AR6 = 8099F8h 

IRO = 8h 

Dataat8099F8h = 1568h 

LUFLV UF N Z V C = 00 

After Instruction: 

R4 = 0FBC4h 

AR6 = 8099F0h 

IR0 = 8h 

Dataat8099F8h = 1568h 

LUFLV UF N Z V C = 



10-194 



Assembly Language Instructions 



Bitwise Exclusive-OR XOR 



Syntax 


XOR src, dst 


Operation 


dstXORsrc->dst 


Operands 


src general addressing modes (G): 




register (Rn, < n < 27) 




1 direct 




1 indirect 




1 1 immediate 



Encoding 



31 



I I 





dst register (Rn, < n < 27) 



24 23 

— I — I — I — I — [— 

110 10 1 



t — i — i — r 

dst 



16 15 87 

t — i — i — i — i — i — i — i — i — r 



t — r 



src 



Description 

Cycles 
Status Bits 



Mode Bit 
Example 



The bitwise exclusive-OR of the srcand dst operands is loaded into the dst reg- 
ister. The dst and src operands are assumed to be unsigned integers. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 

XOR R1,R2 

Before Instruction: 

R1 = 0FFA32h 
R2 = 0FF5C1h 
LUFLV UF N Z V C = 00 

After Instruction: 

R1 =0FF412h 
R2 = 000FF3h 
LUFLV UF N Z V C = 00 
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X0R3 Bitwise Exclusive-OR, 3-Operand 



Syntax 

Operation 

Operands 



Encoding 



31 



i — r~ 
o o 1 



X0R3 src2, srd, dst 

srd XOR src2 -> dst 

srd three-operand addressing modes (T): 
register (Rn1 , < n1 < 27) 

1 indirect (disp = 0, 1, IRO, IR1) 

1 register (Rn1 , < n1 < 27) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

src2 three-operand addressing modes (T): 
register (Rn2, < n2 < 27) 

1 register (Rn2, < n2 < 27) 
10 indirect (disp = 0, 1, IRO, IR1) 

1 1 indirect (disp = 0, 1, IRO, IR1) 

dst register (Rn, < n < 27) 



24 23 



—i — i — i — i — r 

10 



t — r 

dst 



1615 



t — i — i — i — i — i — r 

srd 



87 



t — r 



t — r 



src2 



Description 

Cycles 
Status Bits 



Mode Bit 



The bitwise exclusive-OR between the srd and src2 operands is loaded into 
the dst register. The srd, src2, and dst operands are assumed to be unsigned 
integers. 

1 

These condition flags are modified only if the destination register is R7 — RO. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 
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Bitwise Exclusive-OR, 3-Operand X0R3 



Example 1 



Example 2 



X0R3 *AR3++(IR0) ,R7,R4 

Before Instruction: 

AR3 = 809800h 

IR0 = 10h 

R7 = OFFFFh 

R4 = 0h 

Data at 809800h = 5AC3h 

LUFLV UF N Z V C = 00 

After Instruction: 

AR3 = 80981 Oh 

IR0 = 10h 

R7 = OFFFFh 

R4 = 0A53'Ch 

Data at 809800h = 5AC3h 

LUFLV UF N Z V C = 00 

XOR3 R5,*-AR1 (1) , Rl 

Before Instruction: 



R5 = 0FFA32h 

AR1 = 809826h 

R1 =0h 

Data at 809825h = 0FF5C1 h 

LUF LV UF N Z V C = 







After Instruction: 

R5 = 0FFA32h 

AR1 = 809826h 

R1 = 000F33h 

Data at 809825h = 0FF5C1 h 

LUFLV UF N Z V C = 00 
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XOR3| |STI Parallel XOR3 and STI 



Syntax 



Operation 



Operands 



Encoding 



31 



1 1 



X0R3 src2, srd, dstl 
STI src3, dst2 

srd XOR src2 -» dstl 
src3 -> dst2 



srd 
src2 
dstl 
src3 
dst2 



register (Rn1, < n1 <7) 
indirect (disp = 0, 1, IRO, IR1) 
register (Rn2, < n2 < 7) 
register (Rn3, < n3 < 7) 
indirect (disp = 0, 1, IRO, 1R1) 



— i — i — i — r - 

10 111 



24 23 

— i — r 

dst 



i — r- 
srd 



16 15 



i r 

src3 



t — r 



— i — i — i — r 

dst2 



i — i — i — r 
src2 



t — r 



Description 



Cycles 
Status Bits 



Mode Bit 



A bitwise exclusive-XOR and an integer store are performed in parallel. All reg- 
isters are read at the beginning and loaded at the end of the execute cycle. This 
means that if one of the parallel operations (STI) reads from a register and the 
operation being performed in parallel (XOR3) writes to the same register, then 
STI accepts as input the contents of the register before it is modified by the 
XOR3. 

If src2 and dst2 point to the same location, src2 is read before the write to dst2. 

1 

These condition flags are modified only if the destination register is R7 — R0. 

LUF Unaffected. 

LV Unaffected. 

UF 0. 

N MSB of the output. 

Z 1 if a zero output is generated, otherwise. 

V 0. 

C Unaffected. 

OVM Operation is not affected by OVM bit value. 
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Parallel X0R3 and STI X0R3I ISTI 



Example X0R3 * AR1 ++ , R3 , R3 

I | STI R6,*-AR2 (IRO) 

Before Instruction: 



AR1 


= 80987Eh 




R3 = 


85h 




R6 = 


ODCh = 220 




AR2 


= 8098B4h 




IRO = 


= 8h 




Data at 80987Eh = 


85h 


Data at 8098ACh = 


Oh 


LUF 


LV UF N Z 


V C = 


After Instruction: 









AR1 = 80987Fh 

R3 = Oh 

R6 = ODCh = 220 

AR2 = 8098B4h 

IR0 = 8h 

Data at 80987Eh = 85h 

Data at 8098ACh = ODCh = 220 

LUFLV UF N Z V C = 00 
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ers, Memory, and Cache 



Data Fc md Floating-Point Operation 



Addressing 



Program Flow Control 



External Bus Operation 



Peripherals 



Pipeline Operation 
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Software Applications 



Hardware Applications 



TMS320G3X Signal Description and Electrical Characteristics 



Instruction Opcodes 



Development Support/Part Order Information 



Quality and Reliability 



Calculation of TMS320C30 Power Dissipation 



SMJ320C30 Digital Signal Processor Data Sheet 



Quick Reference Guide 



Software Applications 



Chapter 11 

Software Applications 



The TMS320C3x is a powerful digital signal processor with an architecture and 
instruction set designed to make easy system solutions to DSP problems. 
There are instructions specifically designed for efficient implementation of 
DSP algorithms as well as general-purpose instructions that make the device 
suitable for more general tasks, like any microprocessor. The floating-point 
and integer arithmetic supported by the device permits the designer to concen- 
trate on the algorithm with minimal concerns about scaling, dynamic range, 
and overflows. 

The purpose of this chapter is to explain how to use the instruction set, the ar- 
chitecture, and the interface of the TMS320C3x processor. It presents coding 
examples for frequently used applications and discusses more involved exam- 
ples and applications. This chapter defines the principles involved in the appli- 
cation and gives the corresponding assembly-language code for instructional 
purposes and for immediate use. Whenever the detailed explanation of the un- 
derlying theory is too extensive to be included in this manual, appropriate refer- 
ences are given for further information. 

Major topics discussed in this chapter are listed below. 

Q Processor Initialization (Section 11.1 on page 11-3) 

□ Program Control (Section 11.2 on page 11-7) 

Subroutine calls 
Software stack 
Interrupt handling 
Delayed branches 
Repeat modes 
Computed GOTO's 

-ogical and Arithmetic Operations (Section 11.3 on page 11-21) 

Bit manipulation 
Block moves 
Bit-reversed addressing 
Division 
Square root 
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■ Extended-precision arithmetic 

■ IEEE <==> C3x floating-point conversions 

Qi Application-Oriented Operations (Section 1 1 .4 on page 1 1 -48) 

■ Companding (A-law, p-law) 

■ FIR/I I R filters (fixed and adaptive) 

■ Matrix math 

■ FFT 

■ Lattice filters 

Qi Programming Tips (Section 11 .5 on page 1 1 -88) 

■ C-callable routines 

■ Code optimization check list 

For convenience, the code in this section is located on the Tl DSP Bulletin 
Board System (BBS) at 713-274-2323. 
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Processor Initialization 



11.1 Processor Initialization 



Before you execute a digital signal processing algorithm, it is necessary to ini- 
tialize the processor. Generally, initialization takes place any time the proces- 
sor is reset. 



When reset is activated by applying a low level to the RESET input for several 
cycles, the TMS320C3x terminates execution and puts the reset vector (i.e., 
the contents of memory location 0) in the program counter. The reset vector 
normally contains the address of the system initialization routine. The hard- 
ware reset also initializes various registers and status bits. 

After reset, initialize the processor further by executing instructions that set 
up operational modes, memory pointers, interrupts, and the remaining func- 
tions needed to meet system requirements. 

To configure the processor at reset, the following internal functions should be 
initialized: 

□i Memory-mapped registers 

Qi Interrupt structure 

Example 11-1 shows coding for initializing the TMS320C3x to the following 
machine state, in addition to the initialization performed during the hardware 
reset (for conditions after hardware reset, see Chapter 12): 

Q All interrupts are enabled. 

Q The overflow mode is disabled. 

□i The data memory page pointer is set to zero. 

ti The internal memory is filled with zeros. 

Note that all constants larger than 1 6 bits should be placed in memory and ac- 
cessed through direct or indirect addressing. 
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Example 11-1. TMS320C3x Processor Initialization 



* TITL 'PROCESSOR INITIALIZATION EXAMPLE' 
* 

.global RESET, INIT, BEGIN 

. global INTO , INT1 , INT2 , INT3 

.global ISR0,ISR1,ISR2,ISR3 

.global DINT, DMA 

.global TINT0,TINT1,XINT0,RINT0,XINT1,RINT1 

. global TIMEO , TIME1 , XMTO , RCVO , XMT1 , RCV1 

.global TRAP 0, TRAP 1, TRAP2, TRPO, TRP1, TRP2 

* PROCESSOR INITIALIZATION FOR THE TMS320C3x. 
* 

* RESET AND INTERRUPT VECTOR SPECIFICATION. THIS 

* ARRANGEMENT ASSUMES THAT DURING LINKING, THE FOLLOWING 

* TEXT SEGMENT WILL BE PLACED TO START AT MEMORY 

* LOCATION 0. 



Named section 

RS- load address INIT to PC 

INTO- loads address ISRO to PC 

INT1- loads address ISR1 to PC 

INT2- loads address ISR2 to PC 

INT3- loads address ISR3 to PC 

Serial port transmit interrupt processing 

Serial port receive interrupt processing 

Serial port 1 transmit interrupt processing 

Serial port 1 receive interrupt processing 

Timer interrupt processing 

Timer 1 interrupt processing 

DMA interrupt processing 

Reserved space 

Trap vector processing begins 

Trap 1 vector processing begins 

Trap 2 vector processing begins 

Leave space for the other 29 traps 

*IN THIS SECTION, CONSTANTS THAT CANNOT BE REPRESENTED 

*IN THE SHORT FORMAT ARE INITIALIZED. THE NUMBERS IN PARENTHESIS 

*AT THE END OF THE COMMENTS REPRESENT THE OFFSET OF A 

^PARTICULAR CONTROL REGISTER FROM 

*CTRL (808000H) 





.sect 


"init 


RESET 
* 


.word 


INIT 


INTO 


• word 


ISRO 


INT1 


.word 


ISR1 


INT2 


.word 


ISR2 


INT3 


.word 


ISR3 


XINTO 


.word 


XMTO 


RINTO 


.word 


RCVO 


XINT1 


.word 


XMT1 


RINT1 


.word 


RCV1 


TINTO . 


.word 


TIMEO 


TINT1 


.word 


TIME1 


DINT 


• word 


DMA 




. space 


20 


TRAPO 


.word 


TRPO 


TRAP1 


.word 


TRP1 


TRAP 2 


.word 


TRP2 




. space 


29 





. data 






MASK 


. word 


OFFFFFFFFH 




BLKO 


.word 


0809800H 


Beginn 


BLK1 


.word 


0809C00H 


Beginn 


STCK 


.word 


0809F00H 


• Beginn 


CTRL 


.word 


0808000H 


Pointe 


DMACTL 


.word 


0000000H 


• Initia 


TIMOCTL 


.word 


0000000H 


■ Initia 


TIM1CTL 


.word 


0000000H 


Initia 


SERGLOB0 


.word 


0000000H 


• Init o 


SERPRTXO 


.word 


0000000H 


• Init o 


SERPRTRO 


.word 


0000000H 


• Init o 



ling address of RAM block 
ling address of RAM block 1 
ling of stack 

ir for peripheral— bus memory map 
ilization for DMA control (0) 
ilization of timer control (32) 
ilization of timer 1 control (48) 
)f serial glbl control (64) 
>f serial :-:mt port control (66) 
>f serial rev port control (67) 
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Processor Initialization 



SERTIMO 


• word 


0000000H 


SERGLOB1 


.word 


0000000H 


SERPRTX1 


.word 


0000000H 


SERPRTR1 


.word 


0000000H 


SERTIM1 


.word 


0000000H 


PARINT 


.word 


0000000H 


IOINT 


.word 


0000000H 



Init of serial timer control (68) 
Init of serial 1 glbl control (80) 
Init of serial 1 >:mt port control (82) 
Init of serial 1 rev port control (83) 
Init of serial 1 timer control (84) 
Init parallel interface control (100) 
Init I/O interface control (96) 



.text 

* THE ADDRESS AT MEMORY LOCATION DIRECTS EXECUTION TO BEGIN HERE 

* FOR RESET PROCESSING THAT INITIALIZES THE PROCESSOR. WHEN RESET 

* IS APPLIED, THE FOLLOWING REGISTERS ARE INITIALIZED TO ZERO: 
* 

* ST -- CPU STATUS REGISTER 

* IE -- CPU/DMA INTERRUPT ENABLE FLAGS 

* IF -- CPU INTERRUPT FLAGS 

* IOF-- I/O FLAGS 
* 

* THE STATUS REGISTER HAS THE FOLLOWING ARRANGEMENT: 

* BITS: 31-14 13 12 11 10 9 8 7 6 5 4 3 2 10 

* FUNCTION: RESRV GIE CC CE CF RESRV RM OVM LUF LV UF N Z V C 



INIT 



LDP 


0,DP 


LDI 


1800H,ST 


LDI 


@MASK, IE 



Point the DP register to page 

Clear and enable cache, and disable OVM 

Unmask all interrupts 



INTERNAL DATA MEMORY INITIALIZATION TO FLOATING POINT ZERO 



LDI 


@BLK0,AR0 


LDI 


@BLK1,AR1 


LDF 


0.0, R0 


RPTS 


1023 


STF 


R0, *AR0++(1) 


STF 


R0,*AR1++(1) 



AR0 points to block 

AR1 points to block 1 

Zero register R0 

Repeat 1024 times . . . 

Zero out location in RAM block and 

zero out location in RAM block 1. 



* THE PROCESSOR IS INITIALIZED. THE REMAINING APPLICATION- 

* DEPENDENT PART OF THE SYSTEM (BOTH ON- AND OFF-CHIP SHOULD 

* NOW BE INITIALIZED. 

* FIRST, INITIALIZE THE CONTROL REGISTERS. IN THIS EXAMPLE, 

* EVERYTHING IS INITIALIZED TO ZERO SINCE THE ACTUAL INITIALIZATION IS 

* APPLICATION DEPENDENT.' 



LDI 



LDI 
STI 



@CTRL,AR0 

@DMACTL,R0 
R0, *+AR0 (0) 



LOAD in AR0 the pointer to control 
registers 

Init DMA control 
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LDI @TIMOCTL,RO 

STI R0,*+AR0(32) 

LDI @TIM1CTL,R0 

STI R0,*+AR0(48) 

LDI @SERGLOB0,R0 

STI R0,*+AR0(64) 

LDI @SERPRTXO,RO 

STI R0,*+AR0(66) 

LDI @SERPRTRO,RO 

STI R0,*+AR0(67) 

LDI @SERTIMO,RO 

STI R0,*+AR0(68) 

LDI @SERGLOB1,RO 

STI R0,*+AR0(80) 

LDI @SERPRTX1,R0 

STI R0,*+AR0(82) 

LDI @SERPRTR1,R0 

STI R0,*+AR0(83) 

LDI @SERTIM1,R0 

STI RO,*+ARO (84) 

LDI @PARINT,RO 

STI R0,*+AR0(100) 

LDI @IOINT,R0 

STI R0,*+AR0(96) 

LDI @STCK,SP 

OR 2000H,ST 

BR BEGIN 

. end 



Init timer control 

Init timer 1 control 

Init serial global control 

Init serial :-:mt control 

Init serial rev control 

Init serial timer control 

Init serial 1 global control 

Init serial 1 :-:mt control 

Init serial 1 rev control 

Init serial 1 timer control 

Init parallel interface control (C30 only) 

Init I/O interface control 

Initialize the stack pointer 
Global interrupt enable 

Branch to the beginning of application. 
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Program Control 



11.2 Program Control 



11.2.1 Subroutines 



One group of TMS320C3x instructions provides program control and facili- 
tates all types of high-speed processing. These instructions directly handle: 

Qi subroutine calls, 

Ql software stack, 

Q interrupts, 

Ql zero-overhead branches, and 

Q single- and multiple-instruction loops without any overhead. 



The TMS320C3x has a 24-bit program counter (PC) and a practically unlimited 
software stack. The CALL and CALLcond subroutine calls cause the stack 
pointer to increment and store the contents of the next value of the PC counter 
on the stack. At the end of the subroutine, RETScond performs a conditional 
return. 

Example 11-2 illustrates the use of a subroutine to determine the dot product 
between two vectors. Given two vectors of length N, represented by the arrays 

a [0], a [1] a [N -1] and b [0], b [1] b [N -1], the dot product is computed 

from the expression 

d = a [0] b [0] + a [1] b [1] + ... + a [N -1] b [N -1] 

Processing proceeds in the main routine to the point where the dot product is 
to be computed. It is assumed that the arguments of the subroutine have been 
appropriately initialized. At this point, a CALL is made to the subroutine, tran- 
sferring control to that section of the program memory for execution, then re- 
turning to the calling routine via the RETS instruction when execution has com- 
pleted. Note that for this particular example, it would suffice to save the register 
R2. However, a larger number of registers are saved for demonstration pur- 
poses. The saved registers are stored on the system stack. This stack should 
be large enough to accommodate the maximum anticipated storage require- 
ments. Other methods of saving registers could be used equally well. 
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Example 1 1-2. Subroutine Call (Dot Product) 



TITLE SUBROUTINE CALL (DOT PRODUCT) 



MAIN ROUTINE THAT CALLS THE SUBROUTINE % DOT' TO 'COMPUTE THE 
DOT PRODUCT OF TWO VECTORS. 



LDI @blkO,ARO 
LDI @blkl,ARl 
LDI N,RC 



ARO points to vector a 
AR1 points to vector b 
RC contains the number of elements 



CALL DOT 



SUBROUTINE DOT 



EQUATION: d = a(0) *b(0) +a(l) *b(l) + 



+ a(N-l) * b(N-l) 



THE DOT PRODUCT OF a AND b IS PLACED IN REGISTER RO . N MUST 
BE GREATER THAN OR EQUAL TO 2. 

ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 

ARO | ADDRESS OF a(0) 
AR1 | ADDRESS OF b(0) 
RC | LENGTH OF VECTORS (N) 

REGISTERS USED AS INPUT: ARO, ARl, RC 
REGISTER MODIFIED: RO 
REGISTER CONTAINING RESULT: RO 



DOT 



.globa 


1 DOT 


PUSH 


ST 


PUSH 


R2 


PUSHF 


R2 


PUSH 


ARO 


PUSH 


ARl 


PUSH 


RC 



Save status register 

Use the stack to save R2's' 

lower 32 and upper 32 bits 

Save ARO 

Save ARl 

Save RC 
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MPYF3. *AR0,*AR1,R0 
LDF 0.0, R2 

SUBI 2,RC 

* DOT PRODUCT (1 <= i < N) 



Initialize R0 : 
a(0) * b(0) -> RO 
Initialize R2 . 
Set RC = N-2 



RPTS RC / Setup the repeat single. 

MPYF3 *++AR0(l) , *++ARl (1) , RO ; a(i) * b(i) -> RO 
ADDF3 R0,R2,R2 ; a (i-1) *b (i-1) + R2 -> R2 



ADDF3 R0,R2,R0 
RETURN SEQUENCE 



POP 


RC 


POP 


AR1 


POP 


ARO 


POPF 


R2 


POP 


R2 


POP 


ST 


RETS 




end 




.end 





a(N-l) *b(N-l) + R2 -> RO 



Restore RC 

Restore AR1 

Restore ARO 

Restore top 32 bits of R2 

Restore bottom 32 bits of R2 

Restore ST 

Return 



11.2.2 Software Stack 



The TMS320C3x has a software stack whose location is determined by the 
contents of the stack pointer register SP. The stack pointer increments from 
low to high values, and provisions should be made to accommodate the antici- 
pated storage requirements. The stack can be used not only during the subrou- 
tine CALL and RETS, but also inside the subroutine as a place of temporary 
storage of the registers as shown in Example 11-2. SP always points to the 
last value pushed on the stack. 

The CALL and CALLcond instructions push the value of the program counter 
onto the stack, as do the interrupt routines. Then, RETScond and RETIcond 
pop the stack and place the value in the program counter. The integer value 
of any register can also be pushed onto and popped off the stack using the 
PUSH and POP instructions. There are two additional instructions, PUSHF 
and POPF, for floating point numbers. These instructions can be used to pop 
and push floating point numbers to registers R7 — RO. This feature is very 
useful for saving all 40 bits of the extended precision registers (see 
Example 11-2). Using PUSH and PUSHF on the same register saves the low- 
er 32 and upper 32 bits. PUSH saves the lower 32; PUSHF, the upper 32. To 
recover this extended precision number, a POPF can be done, followed by 
POP. It is important to do the integer and floating-point PUSH and POP in the 
above order. POPF forces the least significant eight bits of the extended-preci- 
sion registers to zero, and therefore must be performed first. 
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The stack pointer (SP) can be read as well as written to. Multiple stacks for 
different program segments may be easily created. SP is not initialized by the 
hardware during reset. It is therefore important to remember to initialize its val- 
ue so that SP points to a predetermined memory location. This avoids the prob- 
lem of SP attempting to write into ROM or over other useful data. 



11.2.3 Interrupt Service Routines 



Interrupts on the TMS320C3x are prioritized and vectored. When an interrupt 
occurs, the corresponding flag is set in the Interrupt Flag Register IF. If the cor- 
responding bit in the Interrupt Enable Register IE is set, and interrupts are en- 
abled by having the GIE bit in the status register set to 1 , interrupt processing 
begins. You can also write to the Interrupt flag register, allowing you to force 
an interrupt by software, or to clear interrupts without processing them. 

The Interrupt Flag Register IF can be read, and action can be taken, depending 
on whether the interrupt has occurred. This is true even when the interrupt is 
disabled. This can be useful when an interrupt-driven interface is not implem- 
ented. Example 1 1 -3 shows the case where a subroutine is called when inter- 
rupt 1 has not occurred. 



Example 1 1-3. Use of Interrupts for Software Polling 

* TITLE INTERRUPT POLLING 



TSTB 2, IF ; Test if interrupt 1 has occurred 

CALLZ SUBROUTINE ; If not, call subroutine 



When interrupt processing begins, the program counter is pushed on the stack, 
and the interrupt vector is loaded in the program counter. Interrupts are then 
disabled by setting the GIE=0, and the program continues from the address 
loaded in the program counter. Since all interrupts are disabled, interrupt pro- 
cessing may proceed without further interruption, unless the interrupt service 
routine re-enables interrupts. 

Except for very simple interrupt service routines, it is important to assure that 
the processor context is saved during execution of this routine. The context 
must be saved before you execute the routine itself, and restored after the rou- 
tine is finished. The procedure is called context switching. Context switching 
is also useful for subroutine calls, especially when extensive use is made of 
the auxiliary and the extended precision registers. Code examples of context 
switching and an interrupt service routine are provided in this section. 
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11.2.3.1 Context Switching 



Context switching is commonly required when processing a subroutine call or 
an interrupt. It may be quite extensive or simple, depending on system require- 
ments. On the TMS320C3x, the program counter is automatically pushed onto 
the stack. Important information in other TMS320C3x registers, such as the 
status, auxiliary, or extended-precision registers, must be saved by special 
commands. In order to preserve the state of the status register, it should be 
pushed first and popped last. This avoids the effects on the status register that 
result when the extended precision registers are restored. 

Example 11-4 and Example 11-5 show saving and restoring of the 
TMS320C3x state. In both examples, the stack is used for saving the registers, 
and it expands towards higher addresses. If you don't want to use the stack 
pointed at by SP, you can create a separate stack by using an auxiliary register 
as the stack pointer. Registers saved in these examples: 

Qi Extended-precision registers R7 through RO, 

□ Auxiliary registers AR7 through ARO, 

□ Data-page pointer DP, 

Qi Index registers IRO and IR1 , 
Qi Block-size register BK, 

□ Status register ST, 

Q Interrupt-related registers IE and IF, 

Q I/O flag IOF, 

□i Repeat-related registers RS, RE, and RC. 
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Example 1 1-4. Context-Save for the TMS320C3x 



TITLE CONTEXT-SAVE FOR THE TMS320C3:-: 



global 



SAVE 



* CONTEXT SAVE ON SUBROUTINE CALL OR INTERRUPT. 
SAVE: 



PUSH 



ST 



Save status register 



SAVE THE EXTENDED PRECISION REGISTERS 



PUSH RO 

PUSHF RO 

PUSH Rl 

PUSHF Rl 

PUSH R2 

PUSHF R2 

PUSH R3 

PUSHF R3 

PUSH R4 

PUSHF R4 

PUSH R5 

PUSHF R5 

PUSH R6 

PUSHF R6 

PUSH R7 

PUSHF R7 



Save 
Save 
Save 
Save 
Save 
Save 
Save 
Save 



the 
and 

the 
and 

the 
and 

the 
and 

the 
and 

the 
and 

the 
and 

the 
and 



lower 32 
the upper 

lower 32 
the upper 

lower 32 
the upper 

lower 32 
the upper 

lower 32 
the upper 

lower 32 
the upper 

lower 32 
the upper 

lower 32 
the upper 



bits of RO 

32 bits 
bits of Rl 

32 bits 
bits of R2 

32 bits 
bits of R3 

32 bits 
bits of R4 

32 bits 
bits of R5 

32 bits 
bits of R6 

32 bits 
.bits of R7 

32 bits 



SAVE THE AUXILIARY REGISTERS 



PUSH 


ARO 


PUSH 


AR1 


PUSH 


AR2 


PUSH 


AR3 


PUSH 


AR4 


PUSH 


AR5 


PUSH 


AR6 


PUSH 


AR7 



Save 


ARO 


Save 


AR1 


Save 


AR2 


Save 


AR3 


Save 


AR4 


Save 


AR5 


Save 


AR6 


Save 


AR7 



SAVE THE REST REGISTERS FROM THE REGISTER FILE 



Save data page pointer 

Save index register IRO 

Save index register IR1 

Save block— size register 

Save interrupt enable register 

Save interrupt flag register 

Save I/O flag register 

Save repeat start address 

Save repeat end address 

Save repeat counter 



PUSH 


DP 


PUSH 


IRO 


PUSH 


IR1 


PUSH 


BK 


PUSH 


IE 


PUSH 


IF 


PUSH 


IOF 


PUSH 


RS 


PUSH 


RE 


PUSH 


RC 



SAVE IS COMPLETE 
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Example 1 1-5. Context-Restore for the TMS320C3x 



TITLE CONTEXT-RESTORE FOR THE TMS320C3x 

.global RESTR 
CONTEXT RESTORE AT THE END OF A SUBROUTINE CALL OR INTERRUPT, 



RESTR: 



RESTORE 


THE R 


POP 


RC 


POP 


RE 


POP 


RS 


POP 


I OF 


POP 


IF 


POP 


IE 


POP 


BK 


POP 


IR1 


POP 


IRO 


POP 


DP 


RESTORE 


THE Al 


POP 


AR7 


POP 


AR6 


POP 


AR5 


POP 


AR4 


POP 


AR3 


POP 


AR2 


POP 


AR1 


POP 


ARO 



THE REST REGISTERS FROM THE REGISTER FILE 



Restore repeat counter 
Restore repeat end address 
Restore repeat start address 
Restore I/O flag register 
Restore interrupt flag register 
Restore interrupt enable register 
Restore block— size register 
Restore index register IR1 
Restore index register IRO 
Restore data page pointer 



THE AUXILIARY REGISTERS 



Restore 


AR7 


Restore 


AR6 


Restore 


AR5 


Restore 


AR4 


Restore 


AR3 


Restore 


AR2 


Restore 


AR1 


Restore 


ARO 



RESTORE THE -EXTENDED PRECISION REGISTERS 



Restore the upper 32 bits 

the lower 32 bits of 
Restore the upper 32 bits 

the lower 32 bits of 
Restore the upper 32 bits 

the lower 32 bits of 
Restore the upper 32 bits 

the lower 32 bits of 
Restore the upper 32 bits 

the lower 32 bits of 
Restore the upper 32 bits 

the lower 32 bits of 
Restore the upper 32 bits 

the lower 32 bits of 
Restore the upper 32 bits 

the lower 32 bits of 
Restore status register 



POPF 


R7 


POP 


R7 


POPF 


R6 


POP 


R6 


POPF 


R5 


POP 


R5 


POPF 


R4 


POP 


R4 


POPF 


R3 


POP 


R3 


POPF 


R2 


POP 


R2 


POPF 


Rl 


POP 


Rl 


POPF 


RO 


POP 


RO 


POP 


ST 


RESTORE 


IS C( 



and 
R7 

and 
R6 

and 
R5 

and 
R4 

and 
R3 

and 
R2 

and 
Rl 

and 
RO 



11-13 



Program Control 



11.2.3.2 Interrupt Priority 



Interrupts on the TMS320C3x are automatically prioritized. This allows inter- 
rupts that occur simultaneously to be serviced in a predefined order. Infre- 
quent, but lengthy, interrupt service routines may need to be interrupted by 
more frequently occurring interrupts. In Example 11-6, the interrupt service 
routine for INT2 temporarily modifies the interrupt enable register (IE) to permit 
interrupt processing when an interrupt to INTO (but no other interrupt) occurs. 
When the routine has finished processing, the IE register is restored to its origi- 
nal state. Notice that the RETIcond instruction not only pops the next program 
counter address from the stack, but also sets the GIE bit of the status register. 
This enables all interrupts that have their interrupt-enable bit set. 



Example 1 1-6. Interrupt Service Routine 



* TITLE INTERRUPT SERVICE ROUTINE 

* .global ISR2 
ENABLE .set 2 00 Oh 
MASK .set 1 

* INTERRUPT PROCESSING FOR EXTERNAL INTERRUPT INT2- 



ISR2 



PUSH 


ST 


PUSH 


DP 


PUSH 


IE 


PUSH 


R0 


PUSHF 


R0 


PUSH 


Rl 


PUSHF 


Rl 


LDI 


MASK, IE 


OR 


ENABLE, ST 



Save status register 

Save data page pointer 

Save interrupt enable register 

Save lower 32 bits and 

upper 32 bits of R0 
Save lower 32 bits and 

upper 32 bits of Rl 
Unmask only INTO 
Enable all interrupts 



MAIN PROCESSING SECTION FOR ISR2 



XOR 


ENABLE, ST 


POPF 


Rl 


POP 


Rl 


POPF 


R0 


POP 


R0 


POP 


IE 


POP 


DP 


POP 


ST 



RET I 



Disable all interrupts 
Restore upper 32 bits and 

lower 32 bits of Rl 
Restore upper 32 bits and 

lower 32 bits of R0 
Restore interrupt enable register 
Restore data page register 
Restore status register 

Return and enable interrupts 
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11.2.4 Delayed Branches 



The TMS320C3x uses delayed branches to create single-cycle branching. The 
delayed branches operate like regular branches but do not flush the pipeline. 
Instead, the three instructions following a delayed branch are also executed. 
As discussed in Chapter 6, Program Flow Control, the only limitations are that 
none of the three instructions following a delayed branch can be a 

Qi Branch (standard or delayed), 

L} Call to a subroutine, 

Ql Return from a subroutine, 

Qi Return from an interrupt, 

Q Repeat instruction, 

£1 TRAP instruction, 

Q IDLE instruction. 

Conditional delayed branches use the conditions that exist at the end of the in- 
struction immediately preceding the delayed branch. Sometimes, a branch is 
necessary in the flow of a program, but fewer than three instructions can be 
placed after a delayed branch. For faster execution, it is still advantageous to 
use a delayed branch. This is shown in Example 11-7, with NOPs taking the 
place of the unused instructions. The trade-off is more instruction words for 
less execution time. 



Example 11-7. Delayed Branch Execution 

* TITLE DELAYED BRANCH EXECUTION 



LDF 


*+ARl(5) , 


BGED 


SKIP 


LDFN 


R2,R1 


SUBF 


3.0,R1 


NOP 





Load contents of memory to R2 
If loaded number >=0, branch (delayed) 
If loaded number <0, load it to Rl 
Subtract 3 from Rl 

Dummy operation to complete delayed 
; branch 
MPYF 1.5,R1 ; Continue here if loaded number <0 



SKIP LDF R1,R3 ; Continue here if loaded number >=0 
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11.2.5 Repeat Modes 



The TMS320C3x supports looping without any overhead. For that purpose, 
there are two instructions: RPTB repeats a block of code, and RPTS repeats 
a single instruction. There are three control registers: RS (repeat start ad- 
dress), RE (repeat end address), and RC (repeat counter). These contain the 
parameters that specify loop execution (refer to Section 7.1 for a complete de- 
scription of RPTB and RPTS). RS and RE are automatically set from the code, 
while RC must be set by the user, as shown in the examples below. 



11.2.5.1 Block Repeat 



Example 1 1 -8 shows an application of the block repeat construct. In this exam- 
ple, an array of 64 elements is flipped over by exchanging the elements that 
are equidistant from the end of the array. In other words, if the original array 
is 

a(1), a(2) a(31),a(32) a(64); 

the final array after the rearrangement will be 

a(64),a(63) a(32),a(31) a(1). 

Because the exchange operation is done on two elements at the same time, 
it requires 32 operations. The repeat counter RC is initialized to 31 . In general, 
if RC contains the number N, the loop will be executed N +1 times. The loop 
is defined by the RPTB instruction and the EXCH label. 
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Example 1 1-8. Loop Using Block Repeat 



TITLE LOOP USING BLOCK REPEAT 

THIS CODE SEGMENT EXCHANGES THE VALUES OF ARRAY ELEMENTS THAT ARE 
SYMMETRIC AROUND THE MIDDLE OF THE ARRAY. 



EXCH 



LDI 
LDI 
AD.DI 

LDI 

RPTB 

LDI 
LDI 
STI 
STI 



@ADDR,AR0 

AR0,AR1 

63,AR1 

31, RC 
EXCH 



; ARO points to the beginning of the array 

; AR1 points to the end of the 

; 64— element array 

; Initialize repeat counter 



; Repeat RC+1 times between here and 
EXCH 
*AR0,R0 ; Load one memory element in RO, 
*AR1,R1 ; and the other in Rl 
Rl, *AR0++ (1) ; Then, exchange their locations 
R0,*AR1--(1) 



Subsection 6.1 .2 specifies restrictions in the block-repeat construct. Because 
the program counter is modified at the end of the loop according to the contents 
of the registers RS, RE, and RC, no operation should attempt to modify the re- 
peat counter or the program counter at the end of the loop in a different way. 

In principle, it is possible to nest repeat blocks. However, there is only one set 
of control registers: RS, RE, and RC. It is therefore necessary to save these 
registers before entering an inside loop. It may be more economical to imple- 
ment a nested loop by the more traditional method of using a register as a 
counter and then using a delayed branch rather than using the nested repeat 
block approach.^ 

Example 11-9 shows another example of using the block repeat to find a maxi- 
mum of 147 numbers. 
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Example 1 1-9. Use of Block Repeat to Find a Maximum 



TITL USE OF BLOCK REPEAT TO FIND A MAXIMUM 
THIS ROUTINE FINDS THE MAXIMUM OF N=14 7 NUMBERS 



LOOP 



LDI 146, RC ; Initialize repeat counter to 147-1 

LDI @ADDR,AR0 ; ARO points to the beginning of the array 

LD *AR0++(1) , RO; Initialize MAX to the first value 

RPTB LOOP 

CMPF *AR0++ (1) , RO ; Compare number to the maximum 

LDFLT *-AR0(l),R0 / If greater, this is a new maximum 



11.2.5.2 Single-Instruction Repeat 



The single-instruction repeat uses the control registers RS, RE, and RC in the 
same way as the block repeat. The advantage over the block repeat is that the 
instruction is fetched only once, and then the buses are available for moving 
operands. One difference to note is that the single-instruction repeat construct 
is not interruptible, while block repeat is interruptible. 

Example 11-10 shows an application of the repeat-single construct. In this ex- 
ample, the sum of the products of two arrays is computed. The arrays are not 
necessarily different. If the arrays are a(i) and b(i), each of length N = 51 2, reg- 
ister RO will contain, after computation, this quantity: 

a (1) b (1) + a (2) b (2) +...+ a (N) b (N). 

The value of the repeat counter (RC) is specified to be 511 in the instruction. 
If RC contains the number N, the loop will be executed N +1 times. 
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Example 11-10. Loop Using Single Repeat 

* TITL LOOP USING SINGLE REPEAT 



THIS CODE SEGMENT COMPUTES SUM [a (i) b (i) ] FOR i = 1 to N 



LDI @ADDR1,AR0 ; ARO points to array a(i) 

LDI @ADDR2,AR1 ; AR1 points to array b(i) 
* 

LDF 0.0, RO ; Initialize R0 
* 

MPYF3 *AR0++(1) , *AR1++(1) , Rl 
* ; Compute first product 

RPTS 511 ; Repeat 512 times 

* 

MPYF3 *AR0++(1) , *AR1++(1) ,R1,R0 ; Compute next product 

J | ADDF3 R1,R0,R0 ; and accumulate the previous one 

ADDF R1,R0 ; One final addition 
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11.2.6 Computed GOTO's 



Occasionally, it is convenient to select during runtime, and not during assem- 
bly, the subroutine to be executed. The TMS320C3x's computed GOTO sup- 
ports this selection. The computed GOTO is implemented using the CALLcond 
instruction in the register addressing mode. This instruction uses the contents 
of the register as the address of the call. Example 11-11 shows the case of 
a task controller. 



Example 11-11. Computed GOTO 



TITL COMPUTED ' GOTO 

TASK CONTROLLER 

THIS MAIN ROUTINE CONTROLS THE ORDER OF TASK EXECUTION (6 TASKS 
IN THE PRESENT EXAMPLE) . TASKO THROUGH TASK5 ARE THE NAMES OF 
SUBROUTINES TO BE CALLED. THEY ARE EXECUTED IN ORDER, TASKO, 
TASK1, . . .TASK5. WHEN AN INTERRUPT OCCURS, THE INTERRUPT 
SERVICE ROUTINE IS EXECUTED, AND THE PROCESSOR CONTINUES 
WITH THE INSTRUCTION FOLLOWING THE IDLE INSTRUCTION. THIS 
ROUTINE SELECTS THE TASK APPROPRIATE FOR THE CURRENT CYCLE, 
CALLS THE TASK AS A SUBROUTINE, AND BRANCHES BACK TO THE IDLE 
TO WAIT FOR THE NEXT SAMPLE INTERRUPT WHEN THE SCHEDULED TASK 
HAS COMPLETED EXECUTION. RO HOLDS THE OFFSET FROM THE BASE 
ADDRESS OF THE TASK TO BE EXECUTED. 



WAIT 



TSKSEQ 



ADDR 



LDI 


5 , RO 


LDI 


@ADDR,AR1 


IDLE 




ADD 1 3 


*AR1 , RO , AR2 / 


SUBI 


1 , RO ; 


LDILT 


5 , RO 


LDI 


*AR2 , Rl : 


CALLU 


Rl 


BR 


WAIT 


.word 


TASK5 


.word 


TASK4 


.word 


TASK3 


.word 


TASK2 


.word 


TASK1 


.word 


TASKO 


.word 


TSKSEQ 



Initialize RO 

AR1 holds the base address of the table 

Wait for the next interrupt 

Add the base address to the table 

Entry number 

Decrement RO 

If R0<0, reinitialize it to 5 

Load the task address 

Execute appropriate task 



Address 
Address 
Address 
Address 
Address 
Address 



of TASK5 
of TASK4 
of TASK3 
of TASK2 
of TASK1 
of TASKO 
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11.3 Logical and Arithmetic Operations 



The TMS320C3x instruction set supports both integer and floating-point arith- 
metic and logical operations. The basic functions of such instructions can be 
combined to form more complex operations. This section examines examples 
of these operations: 

Ql Bit manipulation 

Ql Block moves 

Q Bit-reversed addressing 

□ Integer and floating-point division 

Q Square root 

Ql Extended-precision arithmetic 

Ql Floating-point format conversion between IEEE and TMS320C3x formats 



11.3.1 Bit Manipulation 



Instructions for logical operations, such as AND, OR, NOT, ANDN, and XOR, 
can be used together with the shift instructions for bit manipulation. A special 
instruction, TSTB, tests bits. TSTB does the same operation as AND, but the 
result of the logical AND is used only to set the condition flags and is not written 
anywhere. Example 11-12 and 11-13 demonstrate the use of the several in- 
structions for bit manipulation and testing. 



Example 1 1-12. Use of TSTB for Software-Controlled Interrupt 



TITLE USE OF TSTB, FOR SOFTWARE-CONTROLLED INTERRUPT 

IN THIS EXAMPLE, ALL INTERRUPTS HAVE BEEN DISABLED BY 
RESETTING THE GIE BIT OF THE STATUS REGISTER. WHEN AN 
INTERRUPT ARRIVES, IT IS STORED IN THE IF REGISTER. THE 
PRESENT EXAMPLE ACTIVATES THE INTERRUPT SERVICE ROUTINE INTR 
WHEN IT DETECTS THAT INT2- HAS OCCURRED. 



TSTB 0100b, IF ; Check if bit 2 of IF is set, 
CALLNZ INTR / and, if so, call subroutine INTR 
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Example 11-13. Copy a Bit From One Location to Another 



T.ITLE COPY A BIT FROM ONE LOCATION TO ANOTHER 

BIT I OF Rl NEEDS TO BE COPIED TO BIT J OF R2 . 

ARO POINTS TO A LOCATION HOLDING I, AND IT IS ASSUMED THAT THE 

NEXT MEMORY LOCATION HOLDS THE VALUE J. 



Rl 



R2 



*AR0 



* (ARO+1) 



LDI 


1,R0 


LSH 


*AR0 , RO 


TSTB 


R1,R0 


BZD 


CONT 


LDI 


1,R0 


LSH 


*+AR0(l) ,R0 


ANDN 


R0,R2 


OR 


R0,R2 



Shift 1 to align it with bit I 
Test the I-th bit of Rl 
If bit = 0, branch delayed 

Align 1 with J— th location 

If bit = 0, reset J-th bit of R2 

If bit = 1, set J-th bit of R2 



CONT 
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11.3.2 Block Moves 



Since the TMS320C3x directly addresses a large amount of memory, blocks 
of data or program code can be stored off-chip in slow memories and then 
loaded on-chip for faster execution. Data can also be moved from on-chip to 
off-chip for storage or for multiprocessor data transfers. 

Such data transfers can be accomplished efficiently in parallel with CPU opera- 
tions, using the DMA. The DMA operation is explained in detail in subsection 
8.3 on page 8-38. An alternative to DMA is to perform data transfers under pro- 
gram control using load and store instructions in a repeat mode. 
Example 11-14 shows the transfer of a block of 512 floating-point numbers 
from external memory to block 1 of the on-chip RAM. 



Example 11-14. Block Move Under Program Control 



TITLE BLOCK MOVE UNDER PROGRAM CONTROL 



extern 


.word 


01000H 


blockl 


.word 


0809COOH 




LDI 


@extern,ARO ; 




LDI 


@blockl,ARl ; 




LDF 


*AR0++,R0 




RPTS 


510 




LDF 


*AR0++,R0 


I I 


STF 


R0,*AR1++ 




STF 


R0,*AR1 



Source address 

Destination address 

Load the first number 

Repeat following instruction 511 times 

Load the next number, and. . . 

store the previous' one 

Store the last number 



11.3.3 Bit-Reversed Addressing 



The TMS320C3x can implement Fast Fourier Transforms (FFT) with bit-rev- 
ersed addressing. If the data to be transformed is in the correct order, the final 
result of the FFT is scrambled in bit-reversed order. To recover the frequency- 
domain data in the correct order, certain memory locations must be swapped. 
The bit-reversed addressing mode makes swapping unnecessary. The next 
time data needs to be accessed, the access is done in a bit-reversed manner 
rather than sequentially. The base address of bit-reversed addressing must be 
located on a boundary of the size of the table. For example, if IRO = 2 n_1 , the 
n LSBs of the base address must be zero (0). 

In bit-reversed addressing, IRO holds a value equal to one-half the size of the 
FFT, if real and imaginary data are stored in separate arrays. During accessing, 
the auxiliary register is indexed by IRO, but with reverse carry propagation. 
Example 11-15 illustrates a 512-point complex FFT being moved from the 
place of computation (pointed at by ARO) to a location pointed at by AR1 . In 
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this example, real and imaginary parts XR(i) and Xl(i) of the data are not stored 
in separate arrays, but they are interleaved XR(0), Xl(0), XR(1), Xl(1), ..., 
XR(N-1), XI(N-1). Because of this arrangement, the length of the array is 2N 
instead of N, and IRO is set to 51 2 instead of 256. 



Example 11-15. Bit-Reversed Addressing 



TITLE BIT-REVERSED ADDRESSING 

THIS EXAMPLE MOVES THE RESULT OF THE 512-POINT FFT 
COMPUTATION, POINTED AT BY ARO, TO A LOCATION POINTED AT 
BY AR1. REAL AND IMAGINARY POINTS ARE ALTERNATING. 



Repeat 511+1 times 

Load first imaginary point 

Load real value (and point 
to next location) and store 
the imaginary value 
LOOP LDF *+AR0(l),Rl / Load next imaginary point and store 

previous real value 



LDI 


512, IRO 


LDI 


2,IR1 


LDI 


511, RC 


LDF 


*+AR0 (1) ,R1 


RPTB 


LOOP 


LDF 


*AR0++(IR0)B,R0 


STF 


R1,*+AR1(1) 


LDF 


*+AR0(l) ,R1 


STF 


R0,*AR1++(IR1) 



11.3.4 Integer and Floating-Point Division 



Although division is not implemented as a single instruction in the TMS320C3x, 
the instruction set has the capacity to perform an efficient division routine. Inte- 
ger and floating-point division are examined separately because different algo- 
rithms are used. 

11.3.4. 1 Integer Division 

Division is implemented on the TMS320C3x by repeated subtractions using 
SUBC, a special conditional subtract instruction. Consider the case of a 32-bit 
positive dividend with i significant bits (and 32 - i sign bits), and a 32-bit posi- 
tive divisor with j significant bits (and 32 - j sign bits). The repetition of the 
SUBCcommandi-j +1 timesproducesa32-bitresultwheretheloweri-j +1 
bits are the quotient, and the upper 31 - i + j bits are the remainder of the divi- 
sion. 

SUBC implements binary division in the same manner as long division. The di- 
visor (assumed to be smaller than the dividend) is shifted left i - j times to be 
aligned with the dividend. Then, using SUBC, the shifted divisor is subtracted 
from the dividend. For each subtract that does not produce a negative answer, 

11-24 Software Applications 



Logical and Arithmetic Operations 



the dividend is replaced by the difference. It is then shifted to the left, and a one 
is put in the LSB. If the difference is negative, the dividend is simply shifted left 
by one. This operation is repeated i - j +1 times. 

As an example, consider the division of 33 by 5 using both long division and 
the SUBC method. In this case, i = 6, j = 3, and the SUBC operation is re- 
peated 6 - 3 +1 = 4 times. 



LONG DIVISION: 



00000000000000000000000000000101 



00000000000000000000000000000110 



00000000000000000000000000100001 
-101 
1101 
-101 



11 



Quotient 



Remainder 



SUBC METHOD: 



00000000000000000000000000100001 
000000000000000000000000001 01 000 

Negative difference 

000000000000000000000000001 0001 
000000000000000000000000001 01 000 

00000000000000000000000000011 01 

I 

0000000000000000000000000011 01 01 
000000000000000000000000001 01 000 

00000000000000000000000000001101 

I 

00000000000000000000000000011 01 1 
000000000000000000000000001 01 000 

Negative difference 

I 

0000000000000000000000000011 01 1 



I 

Remainder 



Quot. 



Dividend 
Divisor (aligned) 
(1st SUBC command) 



New Dividend + Quotient 

Divisor 

Difference (>0) (2nd SUBC command) 



New Dividend + Quotient 

Divisor 

Difference (>0) (3rd SUBC command) 



New Dividend + Quotient 

Divisor 

(4th SUBC command) 



Final Result 



When the SUBC command is used, both the dividend and the divisor must be 
positive. Example 11-16 shows a realization of the integer division in which the 
sign of the quotient is properly handled. The last instruction before returning 
modifies the condition flag in case subsequent operations depend on the sign 
of the result. 
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Example 11-16. Integer Division 



TITLE INTEGER DIVISION 
SUBROUTINE DIVI 



INPUTS: 



SIGNED INTEGER DIVIDEND IN RO, 
SIGNED INTEGER DIVISOR IN Rl . 



OUTPUT: R0/R1 into RO . 
REGISTERS USED: R0-R3, IRO, IR1 



OPERATION: 



CYCLES : 



1. NORMALIZE DIVISOR WITH DIVIDEND 

2. REPEAT SUBC 

3. QUOTIENT IS IN LSBs OF RESULT 

31-62 (DEPENDS ON AMOUNT OF NORMALIZATION) 





.globl 


DIVI 


SIGN 


. set 


R2 


TEMPF 


. set 


R3 


TEMP 


. set 


IRO 


COUNT 


. set 


IR1 



DIVI - SIGNED DIVISION 



DIVI: 



DETERMINE SIGN OF RESULT. GET ABSOLUTE VALUE. OF OPERANDS, 



XOR R0,R1,SIGN 
ABSI RO 
ABSI Rl 



CMP I 
BGTD 



R0,R1 
ZERO 



Get the sign 



Divisor > dividend ? 
If so, return 



NORMALIZE OPERANDS. USE DIFFERENCE IN EXPONENTS AS SHIFT COUNT 
FOR DIVISOR, AND AS REPEAT COUNT FOR 'SUBC . 



FLOAT 
PUSHF 
POP 
LSH 

FLOAT 

PUSHF 

POP 

LSH 

SUBI 

LSH 



RO, TEMPF 
TEMPF 
COUNT 
-24, COUNT 

Rl, TEMPF 
TEMPF 
TEMP 

-24, TEMP 
TEMP, COUNT 
COUNT, Rl 



Normalize dividend 

PUSH as float 

POP as int 

Get dividend exponent 

Normalize divisor 

PUSH as float 

POP as int 

Get divisor exponent 

Get difference in exponents 

Align divisor with dividend 



DO COUNT+1 SUBTRACT & SHIFTS. 
RPTS COUNT 
SUBC R1,R0 
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MASK OFF THE LOWER COUNT+1 BITS OF RO 



SUBRI 31, COUNT ; Shift count is (32 - (COUNT+1)) 

LSH COUNT, RO ; Shift left 

NEGI . COUNT 

LSH COUNT, RO ; Shift right to get result 

* CHECK SIGN AND NEGATE RESULT IF NECESSARY. 

NEGI - R0,R1 ; Negate result 

ASH -31, SIGN ; Check sign 

LDINZ R1,R0 ; If set, use negative result 

CMPI 0,R0 ; Set status from result 
RETS 
*• 

* RETURN ZERO. 

ZERO: 

LDI 0,R0 

RETS 

.end 

If the dividend is less than the divisor and you want fractional division, you can 
perform a division after you determine the desired accuracy of the quotient in 
bits. If the desired accuracy is k bits, start by shifting the dividend left by k posi- 
tions. Then apply the algorithm described above, where i should now be re- 
placed by i + k. It is assumed that i + k is less than 32. 

11.3.4.2 Computation of Floating-Point Inverse and Division 

This section presents a method of implementing floating-point division on the 
TMS320C3x. Since the algorithm outlined here computes the inverse of a 
number v, to divide y/v, multiply y by the inverse of v. 

The computation of 1 /v is based on the following iterative algorithm. At the i-th 
iteration, the estimate x [ i ] of 1/v is computed from v, and the previous esti- 
mate x[i-1] according to the formula: 

x[i] = x[i-1]*(2.0-v*x[i-1]) 

To start the operation, an initial estimate x [0] is needed. If v=a*2©, a good initial 
estimate is 

x [0] = 1 .0 * 2 ~ e - 1 

Example 11-17 shows the implementation of this algorithm on the 
TMS320C3x, where the iteration has been applied 5 times. Both accuracy and 
speed are affected by the number of iterations. The accuracy offered by the 
single-precision floating-point format is 2 - 23 = 1 .1 92E - 7. If you want more 
accuracy, use more iterations. If less accuracy is acceptable, reduce the num- 
ber of iterations to increase the execution speed of this implementation. 

This algorithm properly treats the boundary conditions when the input number 
either is zero or has a very large value. When the input is zero, the exponent 
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e = -128. Then the calculation of x [0] yields an exponent equal to 
- (- 1 28) - 1 =1 27, and the algorithm will overflow and saturate. On the other 
hand, in the case of a very large number, e = 1 27, the exponent of x [0] will be 
-127-1 = - 128. This will cause the algorithm to yield zero, which is a rea- 
sonable handling of that boundary condition. 
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Example 11-17. Inverse of a Floating-Point Number 

* 

* TITL INVERSE OF A FLOATING-POINT NUMBER 

* SUBROUTINE INVF 

* THE FLOATING-POINT NUMBER v IS STORED IN RO . AFTER THE 

* COMPUTATION IS COMPLETED, 1/v IS ALSO STORED IN RO . 

* TYPICAL CALLING SEQUENCE: 

* LDF v, RO 

* CALL INVF 
* 

* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

*■ + 

* RO I V = NUMBER TO FIND THE RECIPROCAL OF (UPON THE CALL) 

* RO | 1/v (UPON THE RETURN) 
* 

* REGISTER USED AS INPUT: RO 

* REGISTERS MODIFIED: RO, Rl, R2, R3 

* REGISTER CONTAINING RESULT:. RO 
* 

* CYCLES: 35 WORDS: 32 

.global INVF 

INVF: LDF R0,R3 ; v is saved for later. 

ABSF RO ; The algorithm uses v = |v|. 

* EXTRACT THE EXPONENT OF v. 

PUSHF RO 
POP Rl 

ASH -24, Rl ; The 8 LSBs of Rl contain the exponent 

* ; of v. 

* x[0] FORMATION GIVEN THE EXPONENT OF v. 

NEGI Rl 

SUBI 1,R1 ; Now we have -e-1, the exponent of x[01. 
ASH 24, Rl 
PUSH Rl 

POPF Rl ; Now Rl = x[0] = 1.0 * 2**(-e-l). 
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* NOW THE ITERATIONS BEGIN. 

MPYF R1,R0,R2 

SUBRF 2 . , R2 

MPYF R2,R1 

MPYF R1,R0,R2 

SUBRF 2 . , R2 

MPYF R2,R1 
* 

MPYF R1,R0,R2 

SUBRF 2 . , R2 

MPYF R2 , Rl 

MPYF R1,R0,R2 

SUBRF 2 . , R2 

MPYF R2,R1 



R2 = v * x[0] 

R2=2.0-v*x[0] 

Rl = x[l] = x[0] * (2.0 - v 



:[0]) 



R2 = v * x[l] 

R2 = 2.0 - v * x[l] 

Rl = x[2] = x[l] * (2.0 - v * x[l]) 



R2 = v * x[2] 

R2 = 2.0 - v * x[2] 

Rl = x[3] = x[2] * (2.0 



v ■* x[2] ) 



RND 



Rl 



R2 = v * x[3] 

R2 = 2.0 - v * x[3] 

Rl = x[4] = x[3] * (2.0 - v * x[3]) 

This minimizes error in the LSBs . 



FOR THE LAST ITERATION WE USE THE FORMULATION: 
x[5] = (x[4] * (1.0 - (v * x[4]))) + x[4] 



MPYF R1,R0,R2 

SUBRF 1 . , R2 

MPYF R1,R2 

ADDF R2,R1 



RND 



R1,R0 



R2 = v * x[4] = 1.0. .01. . => 1 

R2 = 1.0 - v * x[4] = 0.0. .01... => 

R2 = x[4] * (1.0 - v * x[4] ) 

R2 = x[5] = (x[4]*(1.0-(v*x[4])))+x[4] 

Round since this is follow by a MPYF. 



* NOW THE CASE OF v < IS HANDLED, 



NEGF R0,R2 

LDF R3,R3 

LDFN R2 , R0 

RETS 



; This sets condition flags 
; If v < 0, then R0 = -R0 



END 



11.3.5 Square Root 



, end 



An iterative algorithm computes square root on the TMS320C3x and is similar 
to the one used for the computation of the inverse. This algorithm computes 
the inverse of the square root of a number v, 1/SQRT(v). To derive SQRT(v), 
multiply this result by v. Since in many applications, division by the square root 
of a number is desirable, the output of the algorithm saves the effort to compute 
the inverse of the square root. 

At the i-th iteration, the estimate x[ i ] of 1/SQRT(v) is computed from v and the 
previous estimate x[i-1] according to this formula: 

x [ i ] = x [i - 1 ] * (1 .5 - (v/2) * x [i - 1 ] * x [I - 1 ]) 
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To start the operation, an initial estimate x[0] is needed. If v = a*2 e , a good ini- 
tial estimate is 

x[0] = 1.0*2-e/2 

Example 11-18 shows the implementation of this algorithm on the 
TMS320C3x, where the iteration has been applied 5 times. Both accuracy and 
speed are affected by the number of iterations. If you want more accuracy, use 
more iterations. If less accuracy is acceptable, reduce the number of iterations 
to increase the execution speed of this implementation. 
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Example 11-18. Square Root of a Floating-Point Number 

* 

* TITLE SQUARE ROOT OF A FLOATING-POINT NUMBER 

* SUBROUTINE SQRT 
* 

* THE FLOATING POINT NUMBER v IS STORED IN RO . AFTER THE 

* COMPUTATION S COMPLETED, SQRT (v) IS ALSO STORED IN RO . NOTE 

* THAT THE ALGORITHM ACTUALLY COMPUTES l/SQRT(v) . 

* TYPICAL CALLING SEQUENCE: 

* LDF v, RO 

* CALL SQRT 

* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* __ + 

* RO I v = NUMBER TO FIND THE SQUARE ROOT OF 
*' I (UPON THE CALL) 

* RO | SQRT(v) (UPON THE RETURN) 

* REGISTER USED AS INPUT: RO 

* REGISTERS MODIFIED: RO, Rl, R2, R3 
. * REGISTER CONTAINING RESULT: RO 

* 

* CYCLES: 50 WORDS: 39 
* 

.global SQRT 

* EXTRACT THE EXPONENT OF V. 

SQRT: LDF R0,R3 / Save v 

; Return if number non— positive 

; The 8 LSBs of Rl contain the exponent of v, 
; Add a rounding bit in the exponent 
; e/2 



LDF 


R0,R3 


RETSLE 




PUSHF 


RO 


POP 


Rl 


ASH 


-24, Rl 


ADD I 


1, Rl 


ASH 

7 1 Pi "DMA T>T 


-1,R1 

PiM CT\TT? 


? UrsJL v !/i 1 X 

NEGI 


Rl 


ASH 


24, Rl 


PUSH 


Rl 


POPF 


Rl 



; Now Rl = >:[0] = 1.0 * 2** (-e/2) . 

* GENERATE V/2. 
* 

MPYF 0.25,R0 ; V/2 and take rounding bit out 

* NOW THE ITERATIONS BEGIN. 
* 

MPYF R1,R1,R2 ; R2 = x[0] * x[0] 

MPYF R0,R2 ; R2 = (v/2) * :-:[0] * :-:[0] 

SUBRF 1.5,R2 ; R2 = 1.5 - (v/2) * x[0] * x[0] 
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MPYF R2,R1 



RND 


Rl 


MPYF 


R1,R1,R2 


MPYF 


R0,R2 


SUBRF 


1.5,R2 


MPYF 


R2,R1 


RND 


Rl 


MPYF 


R1,R1,R2 


MPYF 


R0,R2 


SUBRF 


1 . 5 , R2 


MPYF 


R2,R1 



RND 

MPYF 
MPYF 
SUBRF 
MPYF 

RND 



Rl 

R1,R1,R2 
R0,R2 
1.5,R2 
R2,R1 

Rl 



Rl = x[l] = x[0] * 

(1.5 - (v/2)*x[0]*x[0]) 

R2 = x[l] * x[l] 

R2 = (v/2) * x[l] * x[l] 

R2 = 1.5 - (v/2) * x[l] * x[l] 

Rl = x[2] = x[l] * 

(1.5 - (v/2)*x[l]*x[l]) 

R2 = x[2] * x[2] 

R2 = (v/2) * x[2] * x[2] 

R2 = 1.5 - (v/2) * x[2] * x[2] 

Rl = x[3] = x[2] 

* (1.5 - (v/2)*x[2]*x[2] ) 



R2 = x[3] * x[3] 

R2 = (v/2) * x[3] * x[3] 

R2 = 1.5 - (v/2) * x[3] * x[3] 

Rl = x[4] = x[3] 

* (1.5 - (v/2)*x[3]*x[3]) 



MPYF 


. R1,R1,R2 


MPYF 


R0,R2 


SUBRF 


1 . 5 , R2 


MPYF 


R2,R1 



R2 = x[4] * x[4] 

R2 = (v/2) * x[4] * x[4] 

R2 = 1.5 - (v/2) * x[4] * x[4] 

Rl = x[5] = x[4] 

* (1.5 - (v/2)*x[4]*x[4] ) 



RND 


R1,R0 


MPYF 


R3,R0 


RETS 





Round 

Sqrt (v) from sjqrt (v**(-D ) 



end 



end 
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11.3.6 Extended-Precision Arithmetic 



The TMS320C3x offers 32 bits of precision for integer arithmetic, and 24 bits 
of precision in the mantissa for floating-point arithmetic. For higher precision 
in floating-point operations, the eight extended-precision registers R7 to RO 
contain eight additional bits of accuracy. Since no comparable extension is 
available for fixed-point arithmetic, this section discusses how fixed-point 
double precision can be achieved by using the capabilities of the processor. 
The technique consists of performing the arithmetic by parts, similar to the way 
in which longhand arithmetic is done. 

In the instruction set, operations ADDC (Add with Carry) and SUBB (Subtract 
with Borrow) use the status carry bit for extended-precision arithmetic. The 
carry bit is affected by the arithmetic operations of the ALU and by the rotate 
and shift instructions. It can also be manipulated directly by setting the status 
register to certain values. For proper operation, the overflow mode bit should 
be reset (OVM = 0) so the accumulator results will not be loaded with the satu- 
ration values. Example 11-19 and Example 11-20 show 64-bit addition and 
64-bit subtraction. The first operand is stored in the registers RO (low word) and 
R1 (high word). The second operand is stored in R2 and R3, respectively. The 
result is stored in RO and R1 . 
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Example 11-19. 64-Bit Addition 

* TITLE 64-BIT ADDITION 



TWO 64-BIT NUMBERS ARE ADDED TO EACH OTHER PRODUCING 
A 64-BIT RESULT. THE NUMBERS X (Rl,R0) AND Y (R3,R2) 
ADDED, RESULTING IN W (R1,R0). 

Rl RO 
+ R3 R2 



Rl RO 



ADDI R2,R0 

ADDC R3,R1 

Example 11-20. 64-Bit Subtraction 

* TITLE 64-BIT SUBTRACTION 



TWO 64-BIT NUMBERS ARE SUBTRACTED FROM EACH OTHER 
PRODUCING A 64-BIT RESULT. THE NUMBERS X (R1,R0) AND 
Y (R3,R2) ARE SUBTRACTED, RESULTING IN W (Rl, RO) . 

Rl RO 
+ R3 R2 



Rl RO 



SUBI R2,R0 
SUBB R3,R1 

When two 32-bit numbers are multiplied, a 64-bit product results. The proce- 
dure for multiplication is to split the 32-bit magnitude values of the multiplicand 
X and the multiplier Y into two parts (X1 ,X0) and (X3,X2), respectively, with 1 6 
bits each. The operation is done on unsigned numbers, and the product is ad- 
justed for the sign bit. Example 11-21 shows the implementation of a 32-bit by 
32-bit multiplication. 
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Example 11-21. 32-Bit by 32-Bit Multiplication 



* TITLE 32 X 32 BIT MULTIPLICATION 
* 

* 

* SUBROUTINE EXTMPY 
* 

* FUNCTION: TWO 32-BIT NUMBERS ARE MULTIPLIED, PRODUCING A 64-BIT 

* RESULT. THE TWO NUMBERS (X and Y) ARE EACH SEPARATED INTO TWO 

* PARTS (XI XO) AND (Yl YO) , WHERE XO, XI, YO, AND Yl ARE 16 BITS. 

* THE TOP BIT IN XI AND Yl IS THE SIGN BIT. THE PRODUCT IS 

* IN TWO WORDS (WO AND Wl) . THE MULTIPLICATION IS PERFORMED ON 

* POSITIVE NUMBERS, AND THE SIGN IS DETERMINED AT THE END. 

* XI XO BITS OF PRODUCTS 

* X Yl YO (NC 



X0*Y0 
X0*Y1 
X1*Y0 
X1*Y1 

Wl WO 



ARGUMENT ASSIGNMENTS; 
ARGUMENT | FUNCTION 



HUNTING SIGN) 


PRODUCT 


16+16 


PI 


16+16 


P2 


16+16 


P3 


16+16 


P4 



* RO | MULTIPLIER AND LOW WORD OF THE PRODUCT 

* Rl | MULTIPLICAND AND UPPER WORD OF THE PRODUCT 

* REGISTERS USED AS INPUT: RO, Rl 

* REGISTERS MODIFIED: RO, Rl, R2, R3, R4, ARO, AR1, 

* REGISTER CONTAINING RESULT: R0,R1 

* CYCLES: 28 (WORST CASE) WORDS: 25 

.global EXTMPY 

EXTMPY XOR3 R0,R1,AR0 ; Store sign 

ABSI RO ; Absolute values of X 

ABSI Rl ; and Y 

* SEPARATE MULTIPLIER AND MULTIPLICAND INTO TWO PARTS 
* 

LDI -16,AR1 

LSH3 AR1,R0,R2 ; R2 = XI = Upper 16 bits of X 

AND 0FFFFH,R0 ; RO = XO = Lower 16 bits of X 

LSH3 AR1,R1,R3 ; R3 = Yl = Upper 16 bits of Y 

AND 0FFFFH,R1 ; Rl = YO = Lower 16 bits of Y 
* 

* CARRY OUT THE MULTIPLICATION 

MPYI3 R0,R1,R4 / X0*Y0 = PI 
MPYI R3,R0 ; X0*Y1 = P2 
MPYI R2,R1 ; X1*Y0 = P3 
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DONE 



ADD I 


R0,R1 


MPYI 


R2,R3 


LDI 


R1,R2 


LSH 


16,R2 


CMP I 


0,AR0 


BGED 


DONE 


LSH 


-16, Rl 


ADD 1 3 


R4,R2,R0 


ADDC3 


R1,R3,R1 


2 THE 


PRODUCT IF 


NOT 


RO 


ADD I 


1,R0 


NOT 


Rl 


ADDC 


0,R1 


RETS 




.end 





P2+P3 
X1*Y1 = P4 



Lower 16 bits of P2+P3 

Check the sign of the product 

If >0, multiplication complete (delayed) 

Upper 16 bits of P2+P3 

WO = RO = Lower word of the product 

Wl = Rl = Upper word of the product 
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11.3.7 Floating-Point Format Conversion: IEEE to/from TMS320C3x 

In fixed-point arithmetic, the binary point that separates the integer from the 
fractional part of the number is fixed at a certain location. For example, if a 
32-bit number has the binary point after the most significant bit (which is also 
the sign bit), only fractional numbers (numbers with absolute values less than 
1 ), can be represented. In other words, there is a number called aQ31 number, 
which is a number with 31 fractional bits. All operations assume that the binary 
point is fixed at this location. The fixed-point system, although simple to imple- 
ment in hardware, imposes limitations in the dynamic range of the represented 
number, which causes scaling problems in many applications. You can avoid 
this difficulty by using floating-point numbers. 

A floating-point number consists of a mantissa m multiplied by base b raised 
to an exponent e: 

m*b e 

In current hardware implementations, the mantissa is typically a normalized 
number with an absolute value between 1 and 2, and the base is b = 2. Al- 
though the mantissa is represented as a fixed-point number, the actual value 
of the overall number floats the binary point because of the multiplication by 
b e . The exponent e is an integer whose value determines the position of the 
binary point in the number. IEEE has established a standard format for the re- 
presentation of floating-point numbers. 

In order to achieve higher efficiency in the hardware implementation, the 
TMS320C3x uses a floating-point format that differs from the IEEE standard. 
This section describes briefly the two formats and presents software routines 
to convert between them. 

TMS320C3x floating-point format: 

8 1 23 



e 


s 


f 



In a 32-bit word representing a floating-point number, the first 8 bits correspond 
to the exponent expressed in twos-complement format. There is one bit for sign 
and 23 bits for the mantissa. The mantissa is expressed in twos-complement 
form with the binary point after the most significant non-sign bit. Since this bit 
is the complement of the sign bit s, it is suppressed. In other words, the mantis- 
sa actually has 24 bits. One special case occurs when e = -1 28. In this case, 
the number is interpreted as zero, independently of the values of s and f (which 
are by default set to zero). To summarize, the values of the represented num- 
bers in the TMS320C3x floating-point format are as follows: 



2e*(01.f) 


if s = 


2©*(10.f) 


if s = 1 





if e = -128 
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IEEE floating-point format: 
1 8 23 



s 


e 


f 



The IEEE floating-point format uses sign-magnitude notation for the mantissa, 
and offset by 127 for the exponent. In a 32-bit word representing a 
floating-point number, the first bit is the sign bit. The next 8 bits correspond to 
the exponent, expressed in an offset-by-127 format (the actual exponent is 
e -1 27) . The following 23 bits represent the absolute value of the mantissa with 
the most significant 1 implied. The binary point is after this most significant 1 . 
In other words, the mantissa actually has 24 bits. There are several special 
cases, summarized below. 

These are values of the represented numbers in the IEEE floating-point format: 

(_1)s* 2 e-127 * (o-i .f) if < e < 255 

Special cases: 

(-1) s *0.0 if e = and f = (zero) 

(_1)s * 2 -126 * ( .f) if e = and f <> (denormalized) 

(-1 ) s * infinity if e = 255 and f = (infinity) 

NaN (not a number if e = 255 and f <> 0) 

Based on these definitions of the formats, two versions of the conversion rou- 
tines were developed. One version handles the complete definition of the for- 
mats. The other ignores some of the special cases (typically the ones that are 
very rarely used), but it has the benefit that it executes faster than the complete 
conversion. For this discussion, they are referred to as the complete version 
and the fast version. 
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11.3.7.1 IEEE to TMS320C3x Floating-Point Format Conversion 

Example 1 1-22 shows the fast conversion from IEEE to TMS320C3X floating- 
point format 1 . It properly handles the general case when < e < 255, and also 
handles zeros (i.e., e = Oandf = 0). The other special cases (denormalized, in- 
finity, and NaN) are not treated and, if present, will give erroneous results. 

Example 11-22. IEEE toTMS320C3x Conversion (Fast Version) 

* TITLE IEEE TO TMS320C3:-: CONVERSION (FAST VERSION) 



SUBROUTINE FMIEEE 

FUNCTION: CONVERSION BETWEEN THE IEEE FORMAT AND THE 

TMS320C3X FLOATING POINT NUMBERS. THE NUMBER TO 
BE CONVERTED IS IN THE LOWER 32 BITS OF R0 . 
THE RESULT IS STORED IN THE UPPER 32 BITS OF R0 . 
UPON ENTERING THE ROUTINE, AR1 POINTS TO THE 
FOLLOWING TABLE: 

(0) 0xFF800000 < AR1 

(1) OxFFOOOOOO 

(2) 0x7F000000 

(3) 0x80000000 

(4) 0x81000000 

ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 

R0 | NUMBER TO BE CONVERTED 

AR1 | POINTER TO TABLE WITH CONSTANTS 

REGISTERS USED AS INPUT: R0, AR1 
REGISTERS MODIFIED: R0, Rl 
REGISTER CONTAINING RESULT: R0 

NOTE: SINCE THE STACK POINTER SP IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 



CYCLES: 12 (WORST CASE) WORDS: 12 
.global FMIEEE 

(Example continues on next page) 



1 The fast version of the IEEE-to-TMS320C3x conversion routine was originally 
developed by Keith Henry of Apollo Computer, Inc. The other routines were 
based on this initial input. 
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(Example continued from previous page) 



FMIEEE 


AND 3 


R0,*AR1,R1 




BND 


NEG 




ADD I 


RO , Rl 




LDIZ 


*+ARl(l),Rl ; 




SUBI 


*+ARl(2),Rl 




PUSH 


Rl 




POPF 


RO 


* 


RETS 




NEG 


PUSH 


Rl 




POPF 


RO 




NEGF 


RO , RO 




RETS 





Replace fraction with 

Test sign 

Shift sign and exponent inserting 

If all zero, generate C30 zero 

Unbias exponent 

Load this as a fit. pt . number 



Load this as a fit. pt . number 
Negate if original sign negative 
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Example 11-23 shows the complete conversion between IEEE and 
TMS320C3x formats. In addition to the general case and the zeros, it handles 
the special cases as follows: 

□ If NaN (e = 255, f< >0), the number is returned intact. 

Qi If infinity (e = 255, f = 0), the output is saturated to the most positive or neg- 
ative number, respectively. 

Qi If denormalized (e = 0, f< >0), two cases are considered. If the MSB of f 
is 1 , the number is converted to TMS320C3x format. Otherwise, an under- 
flow occurs and the number is set to zero. 

Example 11-23. IEEE to TMS320C3x Conversion (Complete Version) 

* TITLE IEEE TO TMS320C3x CONVERSION (COMPLETE VERSION) 

* SUBROUTINE FMIEEE1 
* 

* FUNCTION: CONVERSION BETWEEN THE IEEE FORMAT AND THE TMS320C3x 

* FLOATING POINT NUMBERS. THE NUMBER TO BE CONVERTED 

* IS IN THE LOWER 32 BITS OF R0 . THE RESULT IS STORED 

* IN THE UPPER 32 BITS OF R0 . 
* 

* 

* UPON ENTERING THE ROUTINE, AR1 POINTS TO THE FOLLOWING TABLE: 
* 

* (0) 0xFF800000 <--ARl 

* (1) OxFFOOOOOO 

* (2) 0x7F000000 

* (3) 0x80000000 

* (4) 0x81000000 

* (5) 0>:7F800000 

* (6) 0x0040.0000 

* (7) 0x007FFFFF 

* (8) 0x7F7FFFFF 
* 

* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* + 

* R0 | NUMBER TO BE CONVERTED 

* AR1 | POINTER TO TABLE WITH CONSTANTS 

* REGISTERS USED AS INPUT: R0, AR'l 

* REGISTERS MODIFIED: R0, Rl 

* REGISTER CONTAINING RESULT: R0 
* 

(Example continues on next page) 
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(Example continued from previous page) 

NOTE: SINCE THE STACK POINTER SP IS USED, MAKE SURE TO INITIALIZE 
IT IN THE CALLING PROGRAM. 



CYCLES: 23 (WORST CASE) 
.global FMIEEE1 



FMIEEE1 


LDI 


R0,R1 




AND 


*+ARl (5) ,R1 


* 


BZ 


UNNORM 




XOR 


*+ARl (5) ,R1 




BNZ 


NORMAL 



HANDLE NaN AND INFINITY 



TSTB 
RETSNZ 
LDI 
LDFGT 

LDFN 
RETS 



*+ARl (7) ,R0 

R0,R0 
*+ARl (8) ,R0 

*+ARl (5) ,R0 



WORDS: 34 



If e = 0, number is either or 
unnormalized 

If e < 255, use regular routine 



Return if NaN 

If positive, infinity= 
most positive number 

If negative, infinity^ 
most negative number RETS 



* HANDLE ZEROS AND UNNORMALIZED NUMBERS 
UNNORM 



TSTB 


*+ARl (6) 


, R0 


LDFZ 


*+ARl (3) 


, R0 


RETSZ 






XOR 


*+ARl (6) 


,R0 


BND 


NEG1 




LSH 


1,R0 




SUBI 


*+ARl (2) 


, R0 


PUSH 


R0 




POPF 


R0 




RETS 






POPF 


R0 




NEGF 


R0,R0 




RETS 







NEG1 



* HANDLE THE REGULAR CASES 
NORMAL 



NEG 



AND 3 


R0,*AR1,R1 


BND 


NEG 


ADD I 


R0 , Rl 


SUBI 


*+ARl (2) ,R1 


PUSH 


Rl 


POPF 


R0 


RETS 




POPF 


R0 


NEGF 


R0,R0 


RETS 





Is the msb of f equal to 1? 

If not, force the number to zero 

and return 
If (msb of f) = 1, make it 

Eliminate sign bit and line up mantissa 
Make e = -127 

Put number in floating point format 
If negative, negate R0 



Replace fraction with 

Test sign 

Shift sign and exponent inserting 

Unbias exponent 

Load this as a fit. pt . number 



Load this as a fit. pt . number 
Negate if original sign negative 
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11.3.7.2 TMS320C3x to IEEE Floating-Point Format Conversion 



The vast majority of the numbers represented by the TMS320C3x 
floating-point format are covered by the general IEEE format and the represen- 
tation of zeros. The only special case to consider is when e = -127 in the 
TMS320C3x format; this corresponds to a denormalized number in IEEE for- 
mat. It is ignored in the fast version, while it is treated properly in the complete 
version. Example 11-24 shows the fast, and Example 11-25, the complete 
version of the TMS320C3x-to-IEEE conversion. 
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Example 11-24. TMS320C3xto IEEE Conversion (Fast Version) 

* 

* TITLE TMS320C3x TO IEEE CONVERSION (FAST VERSION) 



SUBROUTINE TOIEEE 

FUNCTION: CONVERSION BETWEEN THE TMS320C3x FORMAT AND THE IEEE 
FLOATING POINT NUMBERS . THE NUMBER TO BE CONVERTED 
IS IN THE UPPER 32 BITS OF RO . THE RESULT WILL BE IN 
THE LOWER 32 BITS OF RO . 

UPON ENTERING THE ROUTINE, AR1 POINTS TO THE FOLLOWING TABLE: 



(0) 


0xFF800000 


(1) 


OxFFOOOOOO 


(2) 


0x7F000000 


(3) 


0x80000000 


(4) 


0x81000000 



AR1 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 



R0 | NUMBER TO BE CONVERTED 

AR1 | POINTER TO TABLE WITH CONSTANTS 

REGISTERS USED AS INPUT: R0, AR1 
REGISTERS MODIFIED: R0 
REGISTER CONTAINING RESULT: R0 

NOTE: SINCE THE STACK POINTER *SP' IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 



CYCLES: 14 (WORST CASE) 



WORDS: 15 



.global TOIEEE 



TOIEEE 



NEG 



LD 


R0,R0 




LDFZ 


*+ARl(4) 


,R0 ; 


BND 


NEG 




ABSF 


R0 




LSH 


1,R0 




PUSHF 


R0 




POP 


RO 




ADD I 


*+ARl(2) 


/RO 


LSH 


-1,R0 




RETS 






POP 


RO 




ADD I 


*+ARl(2) 


,R0 


LSH 


-1,R0 




ADD I 


*+ARl(3) 


/RO 


RETS 







Determine the sign of the number 
If zero, load appropriate number 
Branch to NEG if negative (delayed) 
Take the absolute value of the number 
Eliminate the sign bit in R0 

Place number in lower 32 bits of R0 
Add exponent bias (127) 
Add the positive sign 



Place number in lower 32 bits of R0 
Add exponent bias (127) 
Make space for the sign 
Add the negative sign 



11-45 



Logical and Arithmetic Operations 



Example 11-25. TMS320C3x to IEEE Conversion (Complete Version) 

* 

* TITLE TMS320C3x TO IEEE CONVERSION (COMPLETE VERSION) 



SUBROUTINE TOIEEE1 



FUNCTION: CONVERSION BETWEEN THE TMS320C3:-: FORMAT AND THE IEEE 
FLOATING POINT NUMBERS. THE NUMBER TO BE CONVERTED 
IS IN THE UPPER 32 BITS OF RO . THE RESULT WILL BE 
IN THE LOWER 32 BITS OF RO . 



UPON ENTERING THE ROUTINE, AR1 POINTS TO THE FOLLOWING TABLE: 



(0) 
(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 
(8) 



0xFF800000 < 

OxFFOOOOOO 

0x7F000000 

0x80000000 

0x81000000 

0x7F800000 

0x00400000 

0x007FFFFF 

0x7F7FFFFF 



-- AR1 



ARGUMENT ASSIGNMENTS: 
ARGUMENT! FUNCTION 



R0 
AR1 



NUMBER TO BE CONVERTED 

POINTER TO TABLE WITH CONSTANTS 



REGISTERS USED AS INPUT: R0, AR1 
REGISTERS MODIFIED: R0 
REGISTER CONTAINING RESULT: R0 



■NOTE : 



SINCE THE STACK POINTER 'SP' IS USED, MAKE SURE TO 
INITIALIZE IT IN THE CALLING PROGRAM. 



* CYCLES: 
* 


31 


(WORST CASE) 


.global 
* 


TOIEEE1 


TOIEEE1 LDF 




R0,R0 


LDFZ 




*+ARl (4) ,R0 


BND 




NEG 


ABSF 




R0 


LSH 




1,R0 


PUSHF 




R0 


POP 




R0 


ADD I 




*+ARl (2) ,R0 


LSH 




-1,R0 



WORDS: 25 



Determine the sign- of the number 
If zero, load appropriate number 
Branch to NEG if negative (delayed) 
Take the absolute value of the number 
Eliminate the sign bit in R0 

Place number in lower 32 bits of R0 
Add exponent bias (127) 
Add the positive sign 



(Example continues on next page) 
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(Example continued from previous page) 



CONT 



NEG 



TSTB 


*+ARl'(5) 


, RO 


RETSNZ 






TSTB 


*+ARl (7) 


,R0 


RETSZ 






PUSH 


RO 




POPF 


RO 




LSH 


-1,R0 




PUSHF 


RO 




POP 


RO 




ADD I 


*+ARl (6) 


,R0 


RETS 






POP 


RO 




BRD 


CONT 




ADD I 


*+ARI (2) 


,R0 


LSH 


-1,R0 




ADD I 


*+ARl (3) 


, RO 



If E>0, return 

If E=0 & F=0, return 

Move F right by one bit 

Add to F a msb of 1 

Place number in lower 32 bits of RO 

Add exponent bias (127) 
Make space for the sign 
Add the negative sign 
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11.4 Application-Oriented Operations 



Certain features of the TMS320C3x architecture and instruction set facilitate 
the solution of numerically intensive problems. This section presents examples 
of applications using these features, such as companding, filtering, Fast Fouri- 
er Transforms (FFT), and matrix arithmetic. 



11.4.1 Companding 



In the area of telecommunications, one of the primary concerns is to conserve 
the channel bandwidth, and, at the same time, preserve high speech quality. 
This is achieved by quantizing the speech samples logarithmically. It has been 
demonstrated that an 8-bit logarithmic quantizer produces speech quality 
equivalent to a 13-bit uniform quantizer. The logarithmic quantization is 
achieved by companding (COMpress/exPANDing). Two international stan- 
dards have been established for companding: the ja-law (used in the United 
States and Japan), and the A-law (used in Europe). Detailed descriptions of 
u-law and A-law companding are presented in an application report on com- 
panding routines included in the book Digital Signal Processing Applications 
with the TMS320 Family (literature number SPRA01 2A). 

During transmission, logarithmically compressed data in sign-magnitude form 
are transmitted along the communications channel. If any processing is neces- 
sary, these data should be expanded to a 1 4-bit (for u-law) or 1 3-bit (for A-law) 
linear format. This operation is done upon receiving the data at the digital signal 
processor. After processing, and in order to continue transmission, the result 
is compressed back to 8-bit format and transmitted through the channel. 

Example 11 -26 and Example 11 -27 show ja-law compression and expansion 
(i.e.Jinearto |a-law and (i-lawto linear conversion), while Example 11-28 and 
Example 11-29 show A-law compression and expansion. For expansion, us- 
ing a look-up table is an alternative approach. It trades memory space for 
speed of execution. Since the compressed data is 8-bits long, a table with 256 
entries can be constructed containing the expanded data. If the compressed 
data is stored in the register ARO, the following two instructions will put the ex- 
panded data in register RO: 

ADDI @TABL,AR0 ; @TABL = BASE ADDRESS OF TABLE 

LDI *AR0,R0 ; PUT EXPANDED NUMBER IN RO 

The same look-up table approach could be used for compression, but the re- 
quired table length would then be 16,384 words for n-law or 8,192 words for 
A-law. If this memory size is not acceptable, the subroutines presented in 
Example 11-26 or Example 11-28 should be used. 
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Example 11-26. \i-Law Compression 



*TITLE U-LAW COMPRESSION 



SUBROUTINE MUCMPR 



ARGUMENT ASSIGNMENTS 
ARGUMENT! FUNCTION 



RO 



NUMBER TO BE CONVERTED 



REGISTERS USED AS INPUT: RO 
REGISTERS MODIFIED: RO, Rl, R2, SP 
REGISTER CONTAINING RESULT: RO 

NOTE: SINCE THE STACK POINTER 'SP' IS USED IN THE COMPRESSION 
ROUTINE ■* MUCMPR ' , MAKE SURE TO INITIALIZE IT IN THE 
THE CALLING PROGRAM. 



* CYCLES: 20 
* 


WORDS : 


17 


* 

.global 


MUCMPR 




MUCMPR LDI 


R0,R1 




ABSI 


R0,R0 




CMP I 


1FDEH,R0 


, 


LDIGT 


1FDEH,R0 


; 


ADD I 


33, RO 


/ 


FLOAT 


RO 


/ 


MPYF 


0. 03125, RO 


; 


LSH 


1,R0 


; 


PUSHF 


RO 




POP 


RO 


; 


LSH 


-20, RO 


'' 


LDI 


0,R2 




LDI 


R1,R1 


; 


LDILT 


80H,R2 


/ 


ADD I 


R2,R0 


/ 


NOT 


RO 


; 


RETS 







Save sign of number 

If R0>0>:1FDE, 
saturate the result 
Add bias 

Normalize: (seg+5) OWXYZx. . .x 
Adjust segment number by 2** (—5) 
(seg)WXYZx. . .x 

Treat number as integer 
Right— justify 



If number is negative, 

set sign bit 
RO = compressed number 
Reverse all bits for transmission 
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Example 11-27. \i-Law Expansion 



* TITLE *U-LAW EXPANSION' 



* SUBROUTINE MUXPND 



ARGUMENT ASSIGNMENTS: 



ARGUMENT | FUNCTION 

+ 

RO | NUMBER TO BE CONVERTED 

REGISTERS USED AS INPUT: RO 
REGISTERS MODIFIED: RO, Rl, R2, SP 
REGISTER CONTAINING RESULT: RO 



CYCLES: 20 (WORST CASE) 



WORDS: 14 



global MUXPND 



MUXPND NOT 


R0,R0 


LDI 


R0,R1 


AND 


0FH,R1 


LSH 


1,R1 


ADD I 


33, Rl 


LDI 


R0,R2 


LSH 


-4,R0 


AND 


7,R0 


LSH3 


R0,R1,R0 


SUBI 


33, RO 


TSTB 


80H,R2 


RETSZ 




NEGI 


RO 


RETS 





Complement bits 

Isolate quantization bin 

Add bias to introduce lxxxxl 
Store for sign bit 

Isolate segment code 
Shift and put result in RO 
Subtract bias 
Test sign bit 

Negate if a negative number 
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Example 11-28. A-Law Compression 



TITLE A-LAW COMPRESSION 



SUBROUTINE ACMPR 



ARGUMENT ASSIGNMENTS: 
ARGUMENT! FUNCTION 



RO 



| NUMBER TO BE CONVERTED 



REGISTERS USED AS INPUT: RO 
REGISTERS MODIFIED: RO, Rl, R2, SP 
REGISTER CONTAINING RESULT: RO 



NOTE: SINCE THE STACK POINTER A SP' IS USED IN THE COMPRESSION 



ROUTINE A ACMPR' , 
CALLING PROGRAM. 



MAKE SURE TO INITIALIZE IT IN THE 



CYCLES: 22 



WORDS: 19 



.global ACMPR 



ACMPR 



END 



LDI 


RO , Rl 


ABSI 


R0,R0 


CMP I 


1FH,R0 


BLED 


END 


CMP I 


0FFFH,R0 


LDIGT 


0FFFH,R0 


LSH 


-1 , RO 


FLOAT 


RO 


MPYF 


0.125,R0 


LSH 


1 , RO 


PUSHF 


RO 


POP 


RO 


LSH 


-20, RO 


LDI 


0,R2 


LDI 


Rl , Rl 


LDILT 


80H,R2 


ADD I 


R2 , RO 


XOR 


0D5H,R0 


RETS 





Save sign of number 

If R0<0::20, 

Do linear coding 

If R0>0xFFF, 

saturate the result 
Eliminate rightmost bit 

Normalize: (seg+3) OWXYZx . . .x 
Adjust segment number by 2** (—3) 
(seg)WXYZx. . .x 

Treat number as integer 
Right— justify 



If number is negative, 

set sign bit 
RO = compressed number 
Invert even bits for transmission 
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Example 11-29. A-Law Expansion 



TITLE A-LAW EXPANSION 

SUBROUTINE AXPND 

ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 



RO 



NUMBER TO BE CONVERTED 



REGISTERS USED AS INPUT: RO 
REGISTERS MODIFIED: RO, Rl, R2, SP 
REGISTER CONTAINING RESULT: RO 



CYCLES: 25 (WORST CASE) 



WORDS: 16 



AXPND 



SKIP1 



.global 


AXPND 


XOR 


D5H,R0 


LDI 


R0,R1 


AND 


0FH,R1 


LSH 


1,R1 


LDI 


R0,R2 


LSH 


-4,R0 


AND 


7,R0 


BZ 


SKIP1 


SUBI 


1,R0 


ADD I 


32,R1 


ADD I 


1,R1 


LSH3 


R0,R1,R0 


TSTB 


80H,R2 


RETSZ 




NEGI 


RO 


RETS 





Invert even bits 
Isolate quantization bin 
Store for bit sign 
Isolate segment code 



Create 1 :•::•::■::•: 1 

OR Oxxxxl 

Shift and put result in RO 

Test sign bit 

Negate if a negative number 



11.4.2 FIR, IIR, and Adaptive Filters 



Digital filters are a common requirement for digital signal processing systems. 
There are two types of digital filters: Finite Impulse Response (FIR) and Infinite 
Impulse Response (IIR). Each of these types can have either fixed or adapt- 
able coefficients. In this section, the fixed-coefficient filters are presented first, 
and then the adaptive filters are discussed. 
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If the FIR filter has an impulse response h [0], h [1 ],..., h [N -1], and x[n] repre- 
sents the input of the filter at time n, the output y [n] at time n is given by this 
equation: 

y [n] = h [0] x[n] + h [1] x [n-1] + ...+ h [N -1] x[n- (N -1)] 

Two features of the TMS320C3x that facilitate the implementation of the FIR 
filters are parallel multiply/add operations and circular addressing. The first 
one permits the performance of a multiplication and an addition in a single ma- 
chine cycle, while the second one makes a finite buffer of length N sufficient 
for the data x. 

Figure 1 1 -1 shows the arrangement of the memory locations in order to imple- 
ment circular addressing, while Example 1 1-30 presents the TMS320C3x as- 
sembly code for an FIR filter. 



Figure 11-1. Data Memory Organization for an FIR Filter 



Low 


Impulse 
Response 


Oldest Input 
Newest Input 


Initial 
Input Samples 




Final 
Input Samples 




Address 


h(N-1) 


x[n-(N-1)] 




x(n) 






h(N-2) 


x[n-(N-2)] 


x[n-(N-1)] 








• 
• 


• 
• 
• 




• 
• 
• 


Circular 
Queue 




h(1) 


x(n-1) 




x(n-2) 






High 


h(0) 


x(n) 


x(n-1) 





In order to set up circular addressing, initialize the block-size register BK to 
block length N. Also, the locations for signal x should start from a memory loca- 
tion whose address is a multiple of the smallest power of 2 that is greater than 
N. For instance, if N = 24, the first address for x should be a multiple of 32 (the 
lower 5 bits of the beginning address should be zero). To understand this re- 
quirement, look at section 5.3 on page 5-24, the section describing circular ad- 
dressing. 

In Example 1 1-30, the pointer to the input sequence x is incremented and as- 
sumed to be moving from an older input to a newer input. At the end of the sub- 
routine, AR1 will be pointing to the position for the next input sample. 
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Example 11-30. FIR Filter 

* 

* TITLE FIR FILTER 
'* SUBROUTINE FIR 



EQUATION: y (n) = h(0) * x(n) + h(l) * x(n-l) + 
. . . + h(N-l) * x(n-(N-l) ) 

TYPICAL CALLING SEQUENCE: 



LOAD 


ARO 


LOAD 


AR1 


LOAD 


RC 


LOAD 


BK 


CALL 


FIR 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 



ARO | ADDRESS OF h(N-l) 

AR1 | ADDRESS OF x(n-(N-l)) 

RC | LENGTH OF FILTER - 2 (N-2) 

BK | LENGTH OF FILTER (N) 

REGISTERS USED AS INPUT: ARO, AR1, RC, BK 
REGISTERS MODIFIED: RO, R2, ARO, AR1, RC 
REGISTER CONTAINING RESULT: RO 



* CYCLES: 11 + (N-l) WORDS: 6 
* 

.global FIR 

* ; Initialize RO : 
FIR MPYF3 *AR0++(1) ,*AR1++(1) %,R0 

* ; h(N-l) * x(n-(N-l)) -> RO 
LDF 0.0, R2 ; Initialize R2 . 

* FILTER (1 <= i < N) 

RPTS RC ; Setup the repeat cycle. 

MPYF3 *AR0++(1) ,*AR1++(1) %,R0 ; h (N-l-i) *x (n- (N-l-i) ) ->R0 

| | ADDF3 R0,R2,R2 ; Multiply and add operation 

* 

ADDF R0,R2,R0 ; Add last product 
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RETURN SEQUENCE 

RETS 
end 
, end 



11.4.2.2 IIR Filters 



Return 



The transfer function of the IIR filters has both poles and zeros. Its output de- 
pends on both the input and the past output. As a rule, the filters need less com- 
putation than an FIR with similar frequency response, but the filters have the 
drawback of being sensitive to coefficient quantization. Most often, the IIR fil- 
ters are implemented as a cascade of second-order sections, called biquads. 
Example 11-31 and Example 11 -32 show the implementation for one biquad 
and for any number of biquads, respectively. 

This is the equation for a single biquad: 

y[n] = a1 y[r?-1] + a2 y[n-2] + bO x[n] + M x[n-1] + b2x[n-2] 

However, the following two equations are more convenient and have smaller 
storage requirements: 

d[n] = a2d[n-2] + s\ d[n-1]+ x[n] 
y[n) = b2d[n-2] + b\ d [n-^+bO d[n] 

Figure 11-2 shows the memory organization for this two-equation approach, 
and Example 11-31 is an implementation of a single biquad on the 
TMS320C3X. 



Figure 11-2. Data Memory Organization for a Single Biquad 
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As in the case of FIR filters, the address for the start of the values d must be 
a multiple of 4; i.e., the last two bits of the beginning address must be zero. The 
block-size register BK must be initialized to 3. 
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Example 11-31. IIR Filter (One Biquad) 



TITLE IIR filter 

SUBROUTINE I I R 1 

IIR1 == IIR FILTER (ONE BIQUAD) 



EQUATIONS :d(n) = a2 * d(n-2) + al * d(n-l) + x (n) 

y(n) = b2 * d(n-2) + bl * d(n-l) + bO * d(n) 

OR y(n) = al*y(n-l) + a2*y(n-2) + bO*x(n) 
+ bl*x(n-l) + b2*x(n-2) 

TYPICAL CALLING SEQUENCE: 



load 


R2 


load 


ARO 


load 


AR1 


load 


BK 


CALL 


IIR1 



* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 

* + 

* R2 | INPUT SAMPLE X(N) 

* ARO | ADDRESS OF FILTER COEFFICIENTS (A2) 

* AR1 | ADDRESS OF DELAY MODE VALUES (D(N-2)) 

* BK | BK = 3 

* REGISTERS USED AS INPUT: R2, ARO, AR1, BK 

* REGISTERS MODIFIED: RO, Rl, R2, ARO, AR1 

* REGISTER CONTAINING RESULT: RO 
* 

* CYCLES: 11 WORDS: 8 
* 

* FILTER 

•global IIR1 
* 

IIR1 MPYF3 *AR0,*AR1,R0 

* ; a2 * d(n-2) -> RO 
MPYF3 *++AR0 (1) ,*AR1--(D % , Rl 

* ; b2 * d(n-2) -> Rl 
* 

MPYF3 *++AR0(l) , *AR1,R0 ; al * d(n-l) -> RO 

II ADDF3 R0,R2,R2 ; a2*d (n-2) +x (n) -> R2 
* 

MPYF3 *++AR0(l) ,*AR1--(1)%,R0 ; bl * d(n-l) -> RO 

II ADDF3 R0,R2,R2 , ; al*d (n-1) +a2*d (n-2) +x (n) -> R2 
* 

MPYF3 *++AR0(l) ,R2,R2 / bO * d(n) -> R2 
|| STF R2,*AR1++(1)% 
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ADDF 


R0,R2 


addf' 


R1,R2,R0 


RETURN 


SEQUENCE 


RETS 





end 
. end 
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; Store d(n) and point to d(n— 1) . 

/ bl*d(n-l)+bO*d(n) -> R2 

; b2*d(n-2)+bl*d(n-l)+b0*d(n) -> RO 



Return 



In the more general case, the MR filter contains N >1 biquads. The equations 
for its implementation are given by the following pseudo-C language code: 

y[0,n] = x[n] 

for (7=0; /<N; /'++){ 

d[i,n] = a2 [ i] d[i, n-2] + a1 [ /] d[/,/?-1] + y[/-1,/7] 
y[i,n] = t£[i]d[i-2] + b\ [i] d[/,/7-1] + bO [ i] d[i,n] 

} 

y[n] = y[N -1,n] 

Figure 11-3 shows the corresponding memory organization, while 
Example 1 1 -32 shows the TMS320C3x assembly-language code. 
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Figure 11-3. Data Memory Organization for N Biquads 
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The block register BK should be initialized to 3, and the beginning of each set 
of dvalues (i.e., d [i,n ], / = 0...N -1 ) should be at an address that is a multiple 
of 4 (the last two bits zero), as stated in the case of a single biquad. 
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Example 11-32. I IR Filters (N> 1 Biquads) 



TITLE IIR FILTERS (N > BIQUADS) 



SUBROUTINE IIR2 



EQUATIONS: y(0,n) = x (n) 



FOR (i = 0; 
{ 
d(i,n) = 


i < N; 


i++) 








= a2(i) 


* d(i,n- 


-2) 


+ 


al(i) 


y(i,n) = 


= b2(i) 


* d(i,n- 


-2) 


+ 


bl(i) 


TYPICAL 
} 
y(n) = y(N- 


CALLING 


SEQUENCE: 






-l,n) 










TYPICAL CALLING SEQUENCE: 








load 


R2 










load 


ARO 










load 


AR1 










load 


IRO 










load 


IR1 










load 


BK 










load 


RC 










CALL 


IIR2 











d(i,n-l) * y(i-l,n) 
d(i,n-l) * b0(i) * ,d(i,n) 



ARGUMENT ASSIGNMENT: 
ARGUMENT! FUNCTION 



R2 

ARO 

AR1 

BK 

IRO 

IR1 

RC 



INPUT SAMPLE x(n) 

ADDRESS OF FILTER COEFFICIENTS (a2(0)) 

ADDRESS OF DELAY NODE VALUES (d(0,n-2)) 

BK = 3 

IRO = 4 

IR1 = 4*N-4 

NUMBER OF BIQUADS (N) -2 



REGISTERS USED AS INPUT; R2, ARO, AR1, IRO, IR1, BK, RC 
REGISTERS MODIFIED; R0, Rl, R2, ARO, AR1, RC 
REGISTERS CONTAINING RESULT: R0 



CYCLES: 17+6 (N-l) 



WORDS: 17 



.global IIR2 
* 

IIR2 MPYF3 *AR0, *AR1, R0 

* : a2(0) * d(0,n-2) -> R0 

MPYF3 *AR1++(1), *AR1--(1) ',, Rl 
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; b2 (0) * d(0,n-2) -> Rl 



MPYF3 *++AR0(l) , *AR1,R0 ; al (0) * D(0,n-1) -> R0 
ADDF RO, R2, R2 ; First sum term of d(0,n) . 

MPYF3 *++AR0(l) ,*AR1--(D%,R0 ;bl(0) * d(0,n-l) -> RO 
ADDF3 RO, R2, R2 ; Second sum term of d(0,n) . 
MPYF3 *++AR0(l) ,R2,R2 ;b0(0) * d(0,n) -> R2 

STF . R2, *AR1--(D% 



I I 



RPTB 

MPYF3 
ADDF 3 

MPYF3 
ADDF 3 



LOOP 



; Store d(0,n) ; Point to d(0,n-2) 
; Loop for 1 <= i < n 



*++AR0(l) ,*++ARl (IRO) ,R0 ;a2(i) * d(i,n-2) -> RO 
R0,R2,R2 ; First sum term of y(i— l,n) . 

*++AR0(l) ,*AR1-- (1)%R1 ; b2 (i) * D(i,n-2) -> Rl 
R1,R2,R2 ; Second sum term of y(i— l,n) . 



MPYF3 *++AR0(l) ,*AR1,R0 ;al(i) * d(i,n-l) -> RO 

M ADDF3 R0,R2,R2 ; First sum of d(i,n). 

MPYF3 *++AR0(l) ,*AR1--(D%,R0 ;bl(i) * d(i,n-l) -> RO 

II ADDF3 R0,R2,R2 ; Second sum term of d(i,n) . 

STF R2, *AR1--(D% 

* ; Store d(i,n) ; point to d(i,n— 2) 
LOOP MPYF3 *++AR0(l), R2,R2 

* ; b0(i) * d(i,n) -> R2 



FINAL SUMMATION 



ADDF 
ADDF 3 

NOP 
NOP 



R0,R2 
Rl , R2 , RO 

*AR1--(IR1) 
*AR1--(D% 



RETURN SEQUENCE 
RETS 



end 



First sum term of y(n— l,n) 
Second sum term of y (n— l,n) 

Return to first biquad 
Point to d(0,n-l) 



Return 



.end 
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11.4.2.3 Adaptive Filters (LMS Algorithm) 



In some applications in digital signal processing, a filter must be adapted over 
time to keep track of changing conditions. The book Theory and Design of 
Adaptive Filters by Treichler, Johnson, and Larimore (Wiley-lnterscience, 
1 987) presents the theory of adaptive filters. Although in theory, both FIR and 
IIR structures can be used as adaptive filters, the stability problems and the 
local optimum points that the IIR filters exhibit make them less attractive for 
such an application. Hence, until further research makes IIR filters a better 
choice, only the FIR filters are used in adaptive algorithms of practical applica- 
tions. 

In an adaptive FIR filter, the filtering equation takes this form: 

y[n] = h[n,0]x[n] + h [n,1]x [n -1] +...+ h[n,H -1]x[n- (N -1)] 

The filter coefficients are time-dependent. In a least-mean-squares (LMS) al- 
gorithm, the coefficients are updated by an equation in this form: 

h[n+ J l,\] = h[n, J l] + .$x[n-i], /'=0,1 N-1 

p is a constant for the computation. The updating of the filter coefficients can 
be interleaved with the computation of the filter output so that it takes 3 cycles 
per filter tap to do both. The updated coefficients are written over the old filter 
coefficients. Example 11-33 shows the implementation of an adaptive FIR fil- 
ter on the TMS320C3x. The memory organization and the positioning of the 
data in memory should follow the same rules as the above FIR filter with fixed 
coefficients. 
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Example 1 1-33. Adaptive FIR Filter (LMS Algorithm) 

* TITL ADAPTIVE FIR FILTER (LMS ALGORITHM) 



SUBROUTINE LMS 

LMS == LMS ADAPTIVE FILTER 



* EQUATIONS: y(n) = h(n,0)*x(n) + h (n, 1) *:-: (h-1) + ... 

* + h(n,N-l)*x(n-(N-l) ) 
■* FOR (i' = 0; i < N; i++) 

* h(n+l,i) = h(n,i) + tmuerr * x(n-i) 
* 

* TYPICAL CALLING SEQUENCE: 



load 


R4 


load 


ARO 


load 


AR1 


load 


RC 


load 


BK 


CALL 


FIR 



* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 



R4 

ARO 

AR1 

RC 

BK 



SCALE FACTOR (2 * mu * err) 
ADDRESS OF h(n,N-l) 
ADDRESS OF x(n-(N-l)) 
LENGTH OF FILTER - 2 (N-2) 
LENGTH OF FILTER (N) 



* REGISTERS USED AS INPUT: R4, ARO, AR1, RC, BK 

* REGISTERS MODIFIED: R0, Rl, R2, ARO, AR1, RC 

* REGISTER CONTAINING RESULT: R0 

* PROGRAM SIZE: 10 words 

* EXECUTION CYCLES: 12 + 3(N-1) 

* SETUP (i = 0) 
* 

.global LMS 

* ; Initialize R0 : 
LMS MPYF3 *AR0, *AR1, R0 

* ; h(n,N-l) * x(n-(N-l)) -> R0 
LDF 0.0, R2 ; Initialize R2 . 

* 

* ; Initialize Rl : 
MPYF3 *AR1++(1)%, R4, Rl 

* ; x(n-(N-l)) * tmuerr -> Rl 
ADDF3 *AR0++(1), Rl, Rl 

* ; h(n,N-l) + x(n-(N-l)) * 
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* ; tmuerr — > Rl 

* FILTER AND UPDATE (1 <= I < N) 

RPTB LOOP ; Setup the repeat block. 

* ; Filter: 

MPYF3 *AR0--(1) ,AR1,R0 ; h(n,N-l-i) * :: (n- (N-l-i) ) ->R0 
II ADDF3 R0,R2,R2 ; Multiply and add operation. 

* ; UPDATE : 

MPYF3 *AR1++(1)%,R4,R1 ; :•: (n, N- (N-l-i) ) * tmuerr -> Rl 

II STF R1,*AR0++(1) ; Rl -> h (n+1, N-l- (i-1) ) 
* 

LOOP ADDF3 *AR0++(1), Rl, Rl 

* ; h(n,N-l-i) + :•: (n- (N-l-i) ) *tmuerr -> Rl 
* 

ADDF3 R0,R2,R0 ; Add last product. 

STF R1,*-AR0(1) ; h(n,0) + :-,:(n) * tmuerr -> h(n+l,0) 

* RETURN SEQUENCE 

RETS ; Return 

* end 

. end 
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11.4.3 Matrix- Vector Multiplication 



In matrix-vector multiplication, a K x N matrix of elements m(i,j) having K rows 
and N columns is multiplied by an N x T vector to produce a K x 1 result. The 
multiplier vector has elements v(j), and the product vector has elements p(i). 
Each one of the product-vector elements is computed by the following expres- 
sion: 

p(i) = m(i,0) i/(0) + m(/,1) v(1)+...+ m(/,N-1) i/(N-1)/= 0,1 K-1 

This is essentially a dot product, and the matrix-vector multiplication contains, 
as a special case, the dot product presented in Example 11-2 on page 11-8. 
In pseudo-C format, the computation of the matrix multiplication is expressed 
by 

for (/=0; i<K; /++){ 
p(/) = 
for (/= 0; j < N ;j+ +) 

p(/) = p(/) + m(/,/)*v(y) 

} 

Figure 1 1 -4 shows the data memory organization for matrix-vector multiplica- 
tion, and Example 1 1-34shows the TMS320C3x assembly code to implement 
it. Note that in Example 11-34, K (number of rows) should be greater than 0, 
and N (number of columns) should be greater than 1 . 



Figure 1 1-4. Data Memory Organization for Matrix- Vector Multiplication 
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Example 11-34. Matrix Times a Vector Multiplication 

* 

* TITL MATRIX TIMES A VECTOR MULTIPLICATION 

* 

* SUBROUTINE MAT 

* MAT == MATRIX TIMES A VECTOR OPERATION 

* TYPICAL CALLING SEQUENCE: 
* 

* load ARO 

* load AR1 

* load AR2 

* load AR3 

* load Rl 

* CALL MAT 

* ARGUMENT ASSIGNMENTS: 

* ARGUMENT | FUNCTION 



ARO 
AR1 
AR2 
AR3 
Rl 



ADDRESS OF M(0,0) 

ADDRESS OF V(0) 

ADDRESS OF P(0) 

NUMBER OF ROWS - 1 (K-l) 

NUMBER OF COLUMNS - 2 (N-2) 



REGISTERS USED AS INPUT: ARO, AR1, AR2, AR3, Rl 
REGISTERS MODIFIED: RO, R2, ARO, AR1, AR2 , AR3 , IRO, 
RC, RSA, REA 



PROGRAM SIZE: 11 

EXECUTION CYCLES: 6+10 *K+K* (N-l) 



.global MAT 

* SETUP 

MAT LDI R1,IR0 ; number of columns-2 -> IRO 

ADDI 2, IRO ; IRO = N 

* 

* FOR (i = 0; i < K; i++) . LOOP OVER THE ROWS. 
* 

ROWS LDF 0.0, R2 ; initialize R2 

MPYF3 *AR0++(1) ,*AR1++(1) , RO 

* ; m(i,0) * v(0) -> RO 

* FOR (j = 1; j < N; j++) DO DOT PRODUCT OVER COLUMNS 

RPTS Rl ; multiply a row by a column. 

MPYF3 *AR0++(1) , *AR1++(1) , RO / m(i,j) * v(j) -> RO 
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ADDF3 


R0,R2,R2 


DBD 


AR3 , ROWS 


ADDF 


R0,R2 


STF 


R2, *AR2++(1) 


NOP 


*-^ARl (IRO) 



m(i,j-l) * v(j-l) + R2 -> R2. 
counts the number of rows left. 

■ last accumulate. 

result — > p (i) 
• set AR1 to point to v(0) . 

* ! ! ! DELAYED BRANCH HAPPENS HERE ! ! ! 

* RETURN SEQUENCE 

RETS ; return 

* end 

.end 

11.4.4 Fast Fourier Transforms (FFT) 

Fourier transforms are an important tool often used in digital signal processing 
systems. The purpose of the transform is to convert information from the time 
domain to the frequency domain. The inverse Fourier transform converts infor- 
mation back to the time domain from the frequency domain. Implementation 
of Fourier transforms that are computationally efficient are known as Fast 
Fourier Transforms (FFTs). The theory of FFTs can be found in books such as 
DFT/FFT and Convolution Algorithms by C.S. Burrus and T.W. Parks (John 
Wiley, 1985), and in the book Digital Signal Processing Applications with the 
TMS320 Family. 

Certain TMS320C3x features that increase efficient implementation of numeri- 
cally intensive algorithms are particularly well-suited for FFTs. The high speed 
of the device (50-ns cycle time) makes the implementation of real-time algo- 
rithms easier, while the floating-point capability eliminates the problems asso- 
ciated with dynamic range. The powerful indexing scheme in indirect address- 
ing facilitates the access of FFT butterfly legs that have different spans. A con- 
struct that reduces the looping overhead in algorithms heavily dependent on 
loops (such as the FFTs) is the repeat block implemented by the RPTB instruc- 
tion. This construct gives the efficiency of in-line coding but has the form of a 
loop. Since the output of the FFT is in scrambled (bit-reversed) order when the 
input is in regular order, it must be restored to the proper order. This rearrange- 
ment does not require extra cycles. The device has a special form of indirect 
addressing (bit-reversed addressing mode) that can be used when the FFT 
output is needed. This mode permits accessing the FFT output in the proper 
order. 

Fast Fourier Transform is a label for a collection of algorithms that implement 
efficient conversion from time to frequency domain. There are several types 
of FFTs: 

Q Radix-2 and radix-4 algorithms (depending on the size of the FFT butterfly) 

□i Decimation in time or frequency (DIT or DIF) 
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Qi Complex or real FFTs 

Qi FFTs of different lengths, etc. 

The examples in this section of implementation of the FFTwere based on pro- 
grams contained in the Burrus and Parks book, and in the paper Real-Valued 
Fast Fourier Transform Algorithms by H.V. Sorensen, et al (IEEE Trans, on 
ASSP, June 1987). 

Example 11-35 and Example 11-36 show the implementation of a complex 
radix-2, DIF FFT on the TMS320C3x. Example 11-35 contains the generic 
code of the FFT that can be used with any length number. However, for the 
complete implementation of an FFT, a table of twiddle factors (sines/cosines) 
is needed, and this table depends on the size of the transform. To retain the 
generic form of Example 11-35, the table with the twiddle factors (containing 
1 -1 /4 complete cycles of a sine) is presented separately in Example 1 1 -36 for 
the case of a 64-point FFT. A full cycle of a sine should have a number of points 
equal to the FFT size. In Example 11-36, the FFT length N and M, which is 
equal to the logarithm of N to base equal to the radix, are defined. M is the num- 
ber of stages of the FFT. For a 64-point FFT, M = 6 when using a radix-2 algo- 
rithm or M = 3 when using a radix-4 algorithm. If the table with the twiddle fac- 
tors and the FFT code are kept in separate files, they should be connected at 
link time. 
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Example 11-35. Complex, Radix-2, DIFFFT 

* 

* TITL COMPLEX, RADIX-2, DIF FFT 
* 

* GENERIC PROGRAM FOR A LOOPED-CODE RADIX-2 FFT COMPUTATION IN TMS320C3x 
* 

* THE PROGRAM IS TAKEN FROM THE BURRUS AND PARKS BOOK, P. 111. 

* THE (COMPLEX) DATA RESIDE IN INTERNAL MEMORY. THE COMPUTATION 

* IS DONE IN-PLACE, BUT THE RESULT IS MOVED TO ANOTHER MEMORY 

* SECTION TO DEMONSTRATE THE BIT-REVERSED ADDRESSING. 
* 

* THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE PUT IN A .DATA SECTION. 

* THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE THE GENERIC 

* NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE SIZE OF THE FFT 

* N AND LOG2(N) ARE DEFINED IN A . GLOBL DIRECTIVE AND SPECIFIED 

* DURING LINKING. 





.globl 


FFT 




.globl 


N 




.globl 


M 




.globl 


SINE 


INP 


.usect 


"IN", 1024 




.BSS 


OUTP,1024 




• text 




* INITIALIZE 




FFTSIZ 


.word 


N 


LOGFFT 


.word 


M 


SINTAB 


.word 


SINE 


INPUT 


.word 


INP 


OUTPUT 


.word 


OUTP 


FFT: 


LDP 


FFTSIZ 




LDI 


@FFTSIZ,IR1 




LSH 


-2,IR1 




LDI 


0,AR6 




LDI 


@FFTSIZ,IR0 




LSH 


1,IR0 




LDI 


@FFTSIZ,R7 




LDI 


1,AR7 




LDI 


1 , AR5 


* OUTER LOOP 




LOOP : 


NOP 


*++AR6(l) 




LDI 


@ INPUT, ARO 




ADD I 


R7 , ARO , AR2 




LDI 


AR7 , RC 




SUBI 


1,RC 


* ■ FIRST LOOP 






RPTB 


BLK1 




ADDF 


*AR0,*AR2,R 




SUBF 


*AR2++,*AR0 



Entry point for execution 

FFT size 

LOG2 (N) 

Address of sine table 

Memory with input data 
Memory with output data 



; Command to load data page pointer 



; IRl=N/4, pointer for SIN/COS table 

/ AR6 holds the current stage number 

; IR0=2*N1 (because of real/imag) 

; R7=N2 

; Initialise repeat counter of first loop 

; Initialize IE index (AR5=IE) 



; Current FFT stage 

; ARO points to X(I) 

; AR2 points to X(L) 

; RC should be one less than desired # 



R0=X(I)+X(L) 

; R1=X(I)-X(L)' 
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ADDF *AR2 , *ARO , R2 

SUBF *AR2 , *ARO , R3 

STF R2,*AR0-- 

| | STF R3,*AR2-- 

BLK1 STF RO,*ARO++(IRO) 

I | STF R1,*AR2++(IR0) 

* IF THIS IS THE LAST STAGE, 

CMP I @L0GFFT,AR6 
BZD END 

* MAIN INNER LOOP 



LDI 
LDI 
INLOP: ADDI 
LDI 
ADDI 
ADDI 
ADDI 
LDI 
SUBI 
LDF 

* SECOND LOOP 

RPTB 

SUBF 

SUBF 
* 

MPYF 
I I ADDF 



MPYF 

STF 

SUBF 

MPYF 

ADDF 

MPYF 

STF 



2 , AR1 

@SINTAB,AR4 

AR5,AR4 

AR1,AR0' 

2,AR1 

@ INPUT, ARO 

R7 , ARO , AR2 

AR7 , RC 

1,RC 

*AR4,R6 



R2=Y(I)+Y(L) 

R3=Y(I)-Y(L) 

Y(I)=R2 and... 

Y(L)=R3 

X(I)=R0' and... 

X(L)=R1 and ARO, 2 = ARO, 2 + 2*n 

YOU ARE DONE 



Init loop counter for inner loop 
Initialize IA index (AR4=IA) 
IA=IA+IE; AR4 points to cosine 

Increment inner loop counter 
(X(I) ,Y(I) ) pointer 
(X(L) ,Y(L) ) pointer 

RC should be one less than desired # 
R6=SIN 



BLK2 

*AR2, *AR0,R2 

*+AR2,*+AR0,Rl 

R2,R6,R0 
*+AR2,*+AR0,R3 



R2=X(I)-X(L) 

R1=Y(I)-Y(L) 
R0=R2*SIN and, 



I I 



R1,*+AR4 (IR1) ,R3 

R3,*+AR0 

R0,R3,R4 

R1,R6,R0 

*AR2 , *AR0 , R3 

R2,*+AR4 (IR1) ,R3 

R3,*AR0++(IR0) 



R3=Y(I)+Y(L) 

; R3 = Rl * COS and 
Y(I)=Y(I)+Y(L) 
R4=R1*C0S-R2*SIN 
R0=R1*SIN and. . . 
R3=X(I)+X(L) 

; R3 = R2 * COS and. 



ADDF R0,R3,R5 
BLK2 STF R5,*AR2++(IR0) 
I | STF R4,*+AR2 



X(I)=X(I)+X(L) and AR0=AR0+2*Nl 
R5=R2*C0S+R1*SIN 

X(L)=R2*COS+Rl*SIN, incr AR2 and. 
Y (L) =Rl*COS-R2*SIN 



CMP I 
BNE 

LSH 
BRD 
LSH 

LDI 
LSH 



R7 , AR1 
INLOP 

1,AR7 

LOOP 

1,AR5 

R7,IR0 

-1,R7 



Loop back to the inner loop 

Increment , loop counter for next time 

Next FFT stage (delayed) 

IE=2*IE 

N1=N2 

N2=N2/2 



STORE RESULT OUT USING BIT-REVERSED ADDRESSING 



END: 



LDI 

SUBI 

LDI 

LDI 

LDI 

LDI 



@FFTSIZ,RC 

1,RC 

@FFTSIZ, IRO 

2,IR1 

@ INPUT, ARO 

@0UTPUT,AR1 



RC=N 

RC should be one less than desired # 

IR0=size of FFT=N 
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RPTB BITRV 

LDF *+ARO(l),RO 

|| LDF *AR0++(IR0)B,R1 

BITRV STF R0,*+AR1(1) 

|| STF Rl, *ARl++ (IR1) 

SELF BR SELF / Branch to itself at the end 

. end 
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Example 11-36. Table With Twiddle Factors for a 64-PointFFT 

* 

*TITL TABLE WITH TWIDDLE FACTORS FOR A 64-POINT FFT 

* FILE TO BE LINKED WITH THE SOURCE CODE FOR A 64-POINT, RADIX-2 FFT. 





.globl 


SINE 




.globl 


N 




.globl 


M 


N 


. set 


64 


M 


. set 
.data 


6 


SINE 








.float 


0.000000 




.float 


0.098017 




.float 


0.195090 




.float 


0.290285 




.float 


0.382683 




.float 


0.471397 




.float 


0.555570 




.float 


0.634393 




.float 


0.707107 




.float 


0.773010 




.float 


0.831470 




•float 


0.881921 




.float 


0.923880 




.float 


0.956940 




.float 


0.980785 




.float 


0.995185 


COSINE 








.float 


1.000000 




.float 


0.995185 




.float 


0.980785 




.float 


0.956940 




.float 


0.923880 




.float 


0.881921 




-float 


0.831470 




.float 


0.773010 




.float 


0.707107 




.float 


0.634393 




.float 


0.555570 




•float 


0.471397 




•float 


0.382683 




•float 


0.290285 




.float 


■ 0.195090 




.float 


0.098017 




•float 


0.000000 




•float 


-0.098017 




.float 


-0.195090 




.float 


-0.290285 




.float 


-0.382683 




.float 


-0.471397 




.float 


-0.555570 




.float 


-0.634393 




.float 


-0.707107 
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float 


-0. 


773010 


float 


-0. 


831470 


float 


-0. 


881921 


float 


-0. 


,923880 


float 


-0. 


,956940 


float 


-0. 


,980785 


float 


-0. 


,995185 


float 


-1. 


,000000 


float 


-0. 


,995185 


float 


-0. 


,980785 


float 


-0. 


,956940 


float 


-0. 


,923880 


float 


-0. 


.881921 


float 


-0. 


.831470 


float 


-0, 


.773010 


float 


-0. 


.707107 


float 


-0. 


.634393 


float 


-0, 


.555570 


float 


-0. 


.471397 


float 


-0, 


.382683 


float 


-0, 


.290285 


float 


-0, 


.195090 


float 


-0, 


.098017 


float 


0, 


.000000 


float 


0. 


.098017 


float 


0, 


.195090 


float 


0, 


.290285 


float 


0, 


.382683 


float 


0, 


.471397 


float 


0, 


.555570 


float 


0, 


.634393 


float 


0, 


.707107 


float 





.773010 


float 





.831470 


float 





.881921 


.float 





.923880 


float 


0, 


.956940 


float 





.980785 


.float 





.995185 



The radix-2 algorithm has tutorial value because it is relatively easy to under- 
stand how the FFT algorithm functions. However, radix-4 implementations can 
increase the speed of the execution by reducing the overall arithmetic required. 
Example 11-37 shows the generic implementation of a complex, DIF FFT in 
radix-4. A companion table, like the one in Example 1 1 -36, should have a val- 
ue of M equal to the logN, where the base of the logarithm is four. 



1 1 -72 Software Applica tions 



Application-Oriented Operations 



Example 11-37. Complex, Radix-4, DIFFFT 



TITL COMPLEX, RADIX-4, DIF FFT 

GENERIC PROGRAM TO DO A LOOPED-CODE RADIX-4 FFT COMPUTATION IN 
THE TMS320C3x. 

THE PROGRAM IS TAKEN FROM THE BURRUS AND PARKS BOOK, P. 117. 

THE (COMPLEX) DATA RESIDE IN INTERNAL MEMORY, AND THE COMPUTATION 

IS DONE IN-PLACE. 

THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE PUT IN A .DATA SECTION, 
THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE THE GENERIC 
NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE SIZE OF THE 
FFT N AND LOG4 (N) ARE DEFINED IN A . GLOBL DIRECTIVE AND SPECIFIED 
DURING LINKING. 

IN ORDER TO HAVE THE FINAL RESULT IN BIT-REVERSED ORDER, THE TWO 
MIDDLE BRANCHES, OF THE RADIX-4 BUTTERFLY ARE INTERCHANGED DURING 
STORAGE. NOTE THIS DIFFERENCE WHEN COMPARING WITH THE PROGRAM IN 
P. 117 OF THE BURRUS AND PARKS BOOK. 





.globl 


FFT • 




•globl 


N 




.globl 


M 




.globl 


SINE 




.usect 


"IN", IN, 1024 




• text 




* INITIALIZE 




TEMP 


.word 


$+2 


STORE 


.word 


FFTSIZ 




.word 


N 




.word 


M 




.word 


SINE 




.word 


INP 




• BSS 


FFTSIZ, 1 




.BSS 


LOGFFT, 1 




.BSS 


SINTAB, 1 




.BSS 


INPUT, 1 




.BSS 


STAGE, 1 




.BSS 


RPTCNT, 1 




• BSS 


IEINDX, 1 




• BSS 


LPCNT,1 




.BSS 


JT,1 




.BSS 


IA1 , 1 



Entry point for execution 

FFT size 

LOG4 (N) 

Address of sine table 

Memory with input data 



Beginning of temp storage area 



FFT size 

LOG4 (FFTSIZ) 

Sine/cosine table base 

Area with input data to process 

FFT stage # 

Repeat counter 

IE index for sine/cosine 

Second— loop count 

JT counter in program, P. 117 

IA1 index in program, P. 117 
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FFT: 



INITIALIZE DATA LOCATIONS 



LOOP 



LDP 


TEMP 


LDI 


0TEMP, ARO 


LDI 


@ STORE ,AR1 


LDI 


*AR0++,R0 


STI 


R0,*AR1++ 


LDI 


*AR0++,R0 


STI 


RO, *AR1++ 


LDI 


*AR0++,R0 


STI 


RO, *AR1++ 


LDI 


*AR0,R0 


STI 


RO, *AR1 


LDP 


FFTSIZ 


LDI 


@FFTSIZ,R0 


LDI 


@FFTSIZ, IRO 


LDI 


@FFTSIZ,IR1 


LDI 


0,AR7 


STI 


AR7, @ STAGE 


LSH 


1,IR0 


LSH 


-2, IR1 


LDI 


1,AR7 


STI 


AR7, @RPTCNT 


LSH 


-2,R0 


STI 


AR7, @IEINDX 


ADD I 


2,R0 


STI 


RO, @JT 


SUBI 


2,R0 


LSH 


1,R0 


OUTER LOOP 




P: 

LDI 


@ INPUT, ARO 


ADDI 


RO, AR0,AR1 


ADD I 


RO, AR1,AR2 


ADDI 


R0,AR2,AR3 


LDI 


@RPTCNT,RC 


SUBI 


1,RC 


FIRST LOOP 




RPTB 


BLK1 


ADDF 


*+AR0, *+AR2,Rl 



ADDF 



*+AR3, *+ARl,R3 



ADDF 


R3,R1,R6 


SUBF 


*+AR2, *+AR0,R4 


STF 


R6, *+AR0 


SUBF 


R3,R1 


LDF 


*AR2,R5 


LDF 


*+ARl,R7 


ADDF 


*AR3, *AR1,R3 


ADDF 


R5, *AR0,R1 


STF 


Rl, *+ARl 


ADDF 


R3,R1,R6 


SUBF 


R5, *AR0,R2 


STF 


R6, *AR0++(IR0) 



Command to load data page counter 



Xfer data from one memory to the other 



Command to load data page pointer 



@STAGE holds the current stage number 
IR0=2*N1 (because of real/imag) 
IRl=N/4, pointer for SIN/COS table 

Initialize repeat counter of first loop 

Initialize IE inde:-: 

JT=P,0/2+2 

R0=N2 



ARO points to X(I) 
AR1 points to X(I1) 
AR2 points to X(I2) 
AR3 points to X(I3) 

RC should be one less than desired # 



R1=Y(I) +Y(I2) 

R3=Y(I1)+Y(I3) 
R6=R1+R3 

R4=Y(I)-Y(I2) 

Y(I)=R1+R3 

R1=R1-R3 

R5=X(I2) 

R7=Y(I1) 

R3=X(I1)+X(I3) 

R1=X(I)+X(I2) 

Y(I1)=R1-R3 

R6=R1+R3 

R2=X(I)-X(I2) 

X(I)=R1+R3 
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SUBF 


R3,R1 




R1=R1-R3 




SUBF 


*AR3 , *AR1 


, R6 


R6=X(I1)-X(I3) 




SUBF 


R7,*+AR3, 


R3 


• -R3=Y(I1)-Y(I3) 


1 1 


STF 


Rl, *AR1++(IR0) 


• X(I1)=R1-R3 




SUBF 
ADDF 


R6,R4,R5 
R6,R4 




• R5=R4-R6 

• R4=R4+R6 


1 1 


STF 
STF 


R5, *+AR2 
R4, *+AR3 




Y(I2)=R4-R6 
Y(I3)=R4+R6 




SUBF 
ADDF 


R3,R2,R5 
R3,R2 




R5=R2-R3 
R2=R2+R3 


BLK1 
1 1 


STF 
STF 


R5,*AR2++(IR0) 
R2,*AR3++(IR0) 


X(I2)=R2-R3 
■ X(I3)=R2+R3 



IF THIS IS THE LAST STAGE, YOU ARE DONE 



INLOP 



LDI 


@ STAGE ,AR7 


ADD I 


1,AR7 


CMP I 


@LOGFFT,AR7 


BZD 


END 


STI 


AR7,@ STAGE 


J INNER 


LOOP 


LDI 


1,AR7 


STI 


AR7,@IA1 


LDI 


2,AR7 


STI 


AR7,@LPCNT 


LDI 


2,AR6 


ADD I 


@LPCNT,AR6 


LDI 


@LPCNT,AR0 


LDI 


@IA1,AR7 


ADD I 


@IEINDX,AR7 


ADD I 


@INPUT,AR0 


STI 


AR7,@IA1 


ADD I 


R0,AR0,AR1 


STI 


AR6,@LPCNT 


ADD I 


R0,AR1,AR2 


ADDI 


R0,AR2,AR3 


LDI 


@RPTCNT,RC 


SUBI 


1,RC 


CMP I 


@JT,AR6 


BZD 


SPCL 


LDI 


@IA1,AR7 


LDI 


@IA1,AR4 


ADDI 


@SINTAB,AR4 


ADDI 


AR4 , AR7 , AR5 


SUBI 


1,AR5 


ADDI 


AR7,AR5,AR6 


SUBI 


1,AR6 



/ Current FFT stage 

; Init IA1 index 

; Init loop counter for inner loop 

; Increment inner loop counter 



IA1=IA1+IE 

(X(I) ,Y(I) ) pointer 

(X(I1) ,Y(I1) ) pointer 

(X(I2) , Y(I2) ) pointer 
(X(I3) ,Y(I3) ) pointer 

RC should be one less than desired # 
If LPCNT=JT, go to 
special butterfly. 



; Create cosine index AR4 
; IA2=IA1+IA1-1 
; IA3=IA2+IA1-1 
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I I 



I I 



; SECOND LOOP 






RPTB 


BLK2 






ADDF 


*+AR2,*+AR0,R3 




• R3=Y(I)+Y(I2) 


ADDF 


*+AR3,*+ARl,R5 




• R5=Y(I1)+Y(I3) 


ADDF 


R5,R3,R6 




• R6=R3+R5 


SUBF 


*+AR2,*+AR0,R4 




R4=Y(I)-Y(I2) 


SUBF 


R5,R3 




R3=R3-R5 


ADDF 


*AR2,*AR0,R1 




R1=X(I)+X(I2) 


ADDF 


*AR3,*AR1,R5 




R5=X(I1)+X(I3) 


MPYF 


R3,*+AR5(IR1) , 


R6 1 


^6=R3*C02 


STF 


R6,*+AR0 




Y(I)=R3+R5 


ADDF 


R5,R1,R7 




R7=R1+R5 


SUBF 


*AR2 , *AR0 , R2 




R2=X(I)-X(I2) 


SUBF 


R5,R1 




R1=R1-R5 


MPYF 


Rl, *AR5,R7 




R7=R1*SI2 


STF 


R7, *AR0++(IR0) 




X(I)=R1+R5 


SUBF 


R7,R6 




R6=R3*C02-R1*SI2 


SUBF 


*+AR3,*+ARl,R5 




R5=Y(I1)-Y(I3) 


MPYF 


Rl, *+AR5 (IR1) , 


R7 , 


R7=R1*C02 


STF 


R6,*+AR1 




Y (11) =R3*C02-R1*SI2 


MPYF 


R3,*AR5,R6 




R6=R3*SI2 


ADDF 


R7,R6 




R6=R1*C02+R3*SI2 


ADDF 


R5,R2,R1 




R1=R2+R5 


SUBF 


R5,R2 




• R2=R2-R5 


SUBF 


*AR3,*AR1,R5 




R5=X(I1)-X(I3) 


SUBF 


R5,R4,R3 




• R3=R4-R5 


ADDF 


R5,R4 




• R4=R4+R5 


MPYF 


R3,*+AR4 (IR1) , 


R6 


■ R6=R3*C01 


STF 


R6,*AR1++(IR0) 




• X(I1)=R1*C02+R3*SI2 


MPYF 


Rl , *AR4 , R7 




• R7=R1*SI1 


SUBF 


R7,R6 




• R6=R3*C01-R1*SI1 


MPYF 


R1,*+AR4 (IR1) , 


R6 


• R6=R1*C01 


STF 


R6,*+AR2 




■ Y(I2)=R3*C01-R1*SI1 


MPYF 


R3,*AR4,R7 




• R7=R3*SI1 


ADDF 


R7,R6 




• R6=R1*C01+R3*SI1 


MPYF 


R4,*+AR6(IR1) , 


R6 


• R6=R4*C03 


STF 


R6,*AR2++(IR0) 




• X(I2)=R1*C01+R3*SI1 


MPYF 


R2,*AR6,R7 




• R7=R2*SI3 


SUBF 


R7,R6 




• R6=R4*C03-R2*SI3 


MPYF 


R2,*+AR6(IR1) , 


R6 


; R6=R2*C03 


STF 


R6,*+AR3 




• Y(I3)=R4*C03-R2*SI3 


MPYF 


R4,*AR6,R7 




; R7=R4*SI3 


ADDF 


R7,R6 




; R6=R2*C03+R4*SI3 
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BLK2 STF 



R6,*AR3++(IR0) 



CMP I @LPCNT,RO 
BP INLOP 
BR CONT 

SPECIAL BUTTERFLY FOR W=J 



; x(i3)=R2*C03+R4*Sl3 

; LOOP BACK TO THE INNER LOOP 



SPCL 



I I 



I I 



BLK3 



LDI 


IR1,AR4 




LSH 


-1,AR4 


• Point to SIN (45) 


ADD I 


@SINTAB,AR4 


• Create cosine index 


RPTB 


BLK3 




ADDF 


*AR2, *AR0,R1 


• R1=X(I)+X(I2) 


SUBF 


*AR2 , *AR0 , R2 


• R2=X(I)-X(I2) 


ADDF 


*+AR2, *+AR0,R3 


• R3=Y(I)+Y(I2) 


SUBF 


*+AR2, *+AR0,R4 


■ R4=Y(I)-Y(I2) 


ADDF 


*AR3,*AR1,R5 


• R5=X(I1)+X(I3) 


SUBF 


R1,R5,R6 


■ R6=R5-R1 


ADDF 


R5,R1 


R1=R1+R5 


ADDF 


*+AR3,*+ARl,R5 


R5=Y(I1)+Y(I3) 


SUBF 


R5,R3,R7 


R7=R3-R5 


ADDF 


R5,R3 


R3=R3+R5 


STF 


R3, *+AR0 


Y(I)=R3+R5 


STF 


R1,*AR0++(IR0) 


X(I)=R1+R5 


SUBF 


*AR3,*AR1,R1 


R1=X(I1)-X(I3) 


SUBF 


*+AR3,*+ARl,R3 


R3=Y(I1)-Y(I3) 


STF 


R6,*+AR1 


Y(I1)=R5-R1 


STF 


R7,*AR1++(IR0) 


X(I1)=R3-R5 


ADDF 


R3,R2,R5 


R5=R2+R3 


SUBF 


R2 , R3 , R2 


R2=-R2+R3 


SUBF 


R1,R4,R3 


• R3=R4-R1 


ADDF 


R1,R4 


■ R4=R4+R1 


SUBF 


R5,R3,R1 


• R1=R3-R5 


MPYF 


*AR4,R1 


• R1=R1*C021 


ADDF 


R5,R3 


• R3=R3+R5 


MPYF 


*AR4,R3 


• R3=R3*C021 


STF 


R1,*+AR2 


■ Y(I2) = (R3-R5) *C021 


SUBF 


R4,R2,R1 


• R1=R2-R4 


MPYF 


*AR4,R1 


• R1=R1*C021 


STF 


R3, *AR2++(IR0) 


• X(I2)=(R3+R5) *C021 


ADDF 


R4,R2 


• R2=R2+R4 


MPYF 


*AR4 , R2 


• R2=R2*C021 


STF 


Rl, *+AR3 


• Y(I3)=-(R4-R2) *C021 


STF 


R2, *AR3++(IR0) 


• X(I3) = (R4+R2)*C021 


CMP I 


@LPCNT,R0 




BPD 


INLOP 


• Loop back to the ini 
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CONT 



LDI 
LDI 
LSH 

STI 

LSH 

STI 

LDI 

LSH 

ADD I 

STI 

SUBI 

LSH 

BR 



@RPTCNT,AR7 
@IEINDX,AR6 
2 , AR7 

AR7,@RPTCNT 

2,AR6 

AR6,@IEINDX 

R0,IR0 

-3,R0 

2,R0 

RO,@JT 

2,R0 

1,R0 

LOOP 



Increment repeat counter for 
next time 

IE=4*IE 

N1=N2 



JT=N2/2+2 

N2=N2/4 

Next FFT stage 



STORE RESULT OUT USING BIT-REVERSED ADDRESSING 



END 



LDI 

SUBI 

LDI 

LDI 

LDI 

LDP 

LDI 



RPTB 

LDF 

I | LDF 

BITRV STF 

I | STF 



SELF- 



BR 
.end 



@FFTSIZ,RC 

1,RC 

@FFTSIZ, IRO 

2,IR1 

@ INPUT, ARO 

STORE 

@ STORE, AR1 

BITRV 

*+AR0(l) ,R0 
*AR0++(IR0)B,R1 
R0,*+AR1 (1) 
R1,*AR1++(IR1) 

SELF 



RC=N 

RC should be one less than desired # 

IR0=size of FFT=N 



Branch to itself at the end. 



Most often, the data to be transformed is a sequence of real numbers. In this 
case, the FFT demonstrates certain symmetries that permit the reduction of 
the computational load even further. Example 1 1 -38 shows the generic imple- 
mentation of a real-valued, radix-2 FFT. For such an FFT, the total storage re- 
quired for a length-N transform is only N locations; in a complex FFT, 2N are 
necessary. Recovery of the rest of the points is based on the symmetry condi- 
tions. 
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Example 11-38. Real, Radix-2FFT 



TITL REAL, RADIX-2 FFT 

GENERIC PROGRAM TO DO A RADIX-2 REAL FFT COMPUTATION IN TMS320C3x. 

THE PROGRAM IS TAKEN FROM THE PAPER BY SORENSEN ET AL., JUNE 1987 
ISSUE OF THE TRANSACTIONS ON ASSP . 

THE REAL DATA RESIDE IN INTERNAL MEMORY. THE COMPUTATION IS 
DONE IN-PLACE. THE BIT-REVERSAL IS DONE AT THE BEGINNING OF 
THE PROGRAM. 

THE TWIDDLE FACTORS ARE SUPPLIED IN A TABLE PUT IN A .DATA 
SECTION. THIS DATA IS INCLUDED IN A SEPARATE FILE TO PRESERVE 
THE GENERIC NATURE OF THE PROGRAM. FOR THE SAME PURPOSE, THE 
SIZE OF THE FFT N AND LOG2 (N) ARE DEFINED IN A . GLOBL DIRECTIVE 
AND SPECIFIED DURING LINKING. THE LENGTH OF THE TABLE IS 
N/4 + N/4 = N/2. 



,globl FFT 

, globl N 

.globl M 

, globl SINE 



.bss 



INP,1024 



Entry point for execution 

FFT size 

LOG2 (N) 

Address of sine table 

Memory with input data 



.text 



* 


INITIALIZE 




FFTSIZ 


.word 


N 


LOGFFT 


• word 


M 


SINTAB 


.word 


SINE 


INPUT 


• word 


INP 


FFT 


: 


LDP 


FFTSIZ 


* 


DO 


THE BIT- 


-REVERSING AT 






LDI 


@FFTSIZ,RC 






SUBI 


1,RC 






LDI 


@FFTSIZ,IRO 






LSH 


-1,IR0 






LDI 


@ INPUT, ARO 






LDI 


@INPUT,AR1 






RPTB 


BITRV 






CMP I 


AR1 , ARO 






BGE 


CONT 






LDF 


*AR0,R0 


1 1 




LDF 


*AR1,R1 






STF 


RO, *AR1 


1 1 




STF 


Rl, *AR0 



Command to load data page printer 



RC=N 

RC should be one less than desired 

IR0=half the size of FFT=N/2 



Exchange locations only 
if ARCKAR1 
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CONT NOP 
BITRV NOP 



*AR0++ 
*AR1++(IR0)B 



LENGTH-TWO BUTTERFLIES 



LDI 
LDI 
SUBI 



@ INPUT, ARO 

IR0,RC 

1,RC 



ARO points to X(I) 

Repeat N/2 times 

RC should be one less than desired # 



RPTB 
ADDF 



SUBF 



BLK1 STF 
I | STF 



BLK1 

*+AR0, *AR0++,R0 

*AR0,*-AR0,R1 

RO , *-AR0 
R1,*AR0++ 



R0=X(I)+X(I+1) 

R1=X(I)-X(I+1) 
X(I)=X(I)+X(I + 1) 
X(I + 1)=X(I)-X(I + 1) 



* FIRST PASS OF THE DO-20 LOOP (STAGE K=2 IN DO-10 LOOP) 



LDI 


@ INPUT, ARO 


LDI 


2,IR0 


LDI 


@FFTSIZ,RC 


LSH 


-2,RC 


SUBI 


1, RC 


RPTB 


BLK2 


ADDF 


*+AR0(IR0) 



SUBF 
* 

NEGF 
I | STF 
BLK2 STF 



; ARO points to X(I) 

; IR0=2=N2 

; Repeat N/4 times 

; RC should be one less than desired # 



; R0=X(I)+X(I+2) 
*AR0,*-AR0(IR0) ,R1 

Rl=X(I)-X(I+2) 



*+AR0,R0 
R0,*-AR0 (IRO) 
R1,*AR0++(IR0) 



STF R0,*+AR0 
MAIN LOOP (FFT STAGES) 



R0=-X(I+3) 

X ( I ) =X ( I ) +X ( 1+2 ) 

X(I+2)=X(I)-X(I+2) 
X(I+3)=-X(I+3) 





LDI 


@FFTSIZ, 


IRO 






LSH 


-2, IRO 




IR0=inde:-: for E 




LDI 


3,R5 




R5 holds the current stage number 




LDI 


1,R4 




R4=N4 




LDI 


2,R3 




R3=N2 


LOOP' 


LSH 


-1, IRO 




E=E/2 




LSH 


1,R4 




N4=2*N4 




LSH 


1,R3 




• N2=2*N2 



INNER LOOP, (DO-2 LOOP IN THE PROGRAM) 

; AR5 points to X(I) 



LDI 
INLOP LDI 
ADD I 
LDI 

LDI 
ADD I 
LDI 



@ INPUT, AR5 

IRO, ARO 

@SINTAB,ARO ; ARO points to SIN/COS table 

R4,IR1 ; IR1=N4 



AR5 , AR1 

1,AR1 

AR1,AR3 



; AR1 points to X (II) =X (I+J) 
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Example 11-38. Real, Radix-2 FFT (Concluded) 



ADD I 
LDI 
SUBI 
ADD I 

LDF 

ADDF 

SUBF 

STF 

NEGF 

NEGF 

STF 
STF 



R3 , AR3 

AR3 , AR2 

2,AR2 

R3 , AR2 , AR4 

*AR5++(IR1) ,R0 

*+AR5 (IR1) ,R0,R1 



AR3 points to X (13) =X (I+J+N2) 

AR2 points to X (12) =X (I-J+N2) 

AR4 points to X (14) =X (I-J+Nl) 

R0=X(I) 



R1=X(I)+X(I+N2) 



R0,*++AR5(IR1) , RO 



LDI 
LSH 
LDI 
SUBI 

RPTB 
MPYF 

MPYF 

MPYF 

I | ADDF 

MPYF 

SUBF 
SUBF 
ADDF 

I | STF 
ADDF 

I | STF 
SUBF 

I | STF 

BLK3 STF 

SUBI 
ADD I 
CMP I 
BLTD 
ADD I 
NOP 
NOP 



R1,*-AR5 (IR1) 

RO 

*++AR5 (IR1) ,R1 

R0,*AR5 
R1,*AR5 

* INNERMOST LOOP 

@FFTSIZ,IR1 
-2,IR1 
R4,RC 
2,RC 



R0=-X(I)+X(I+N2) 

X(I)=X(I)+X(I+N2) 

R0=X(I)-X(I+N2) 

R1=-X(I+N4+N2) 
X(I+N2)=X(I)-X(I+N2) 
X ( I+N4+N2 ) =-X ( I+N4+N2 ) 



; IRl=separation between SIN/COS tbls 
; Repeat N4— 1 times 



BLK3 

*AR3, *+AR0 (IR1) , 

*AR4,*AR0,R1 
*AR4,*+AR0(IR1) , 
RO , Rl , R2 
*AR3,*AR0++(IR0) 

R0,R1,R0 

*AR2,R0,R1 

*AR2,R0,R1 

R1,*AR3++ 

*AR1,R2,R1 

R1,*AR4-- 

R2,*AR1,R1 

R1,*AR1++ 

R1,*AR2-- 

@ INPUT, AR5 

R4, AR5 

@FFTSIZ,AR5 

INLOP 

@ INPUT, AR5 



RO 

R0=X(I3) *COS 

R1=X(I4)*SIN 
Rl ; Rl=OC(I4) *COS 

R2=X(I3) *COS+X.(I4)*SIN 
,R0 

R0=X(I3) *SIN 

R0=-X(I3) *SIN+X(I4) *COS 

R1=-X(I2)+R0 

R1=X(I2)+R0 

X(I3)=-X(I2)+R0 

R1=X(I1)+R2 

X(I4)=X(I2)+R0 

R1=X(I1)-R2 

X(I1)=X(I1)+R2 

X(I2)=X(I1)-R2 

AR5=I+N1 

Loop back to the inner loop 



END 



ADD I 
CMP I 
BLE 

BR 
.end 



1,R5 

@LOGFFT,R5 

LOOP 

END 



; Branch to itself at the end, 
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The TMS320C3x quickly executes FFT lengths up to 1024 points (complex) 
or 2048 (real), covering most applications, because it can do so almost entirely 
in on-chip memory. Table 11-1 summarizes the execution time required for 
FFT lengths between 64 and 1024 points for the three algorithms in 
Example 11-35, 11-37, and 11-38. 



Table 11-1. TMS320C3x FFT Timing Benchmarks 



Number 
of Points 


RADIX-2 
(Complex) 


FFT Timing 

in milliseconds 

RADIX-4 

(complex) 


RADIX-2 
(real) 


64 
128 
256 
512 
1024 


0.165 
0.370 
0.816 
1.784 
3.873 


0.123 
0.624 
3.040 


0.077 
0.174 
0.387 
0.857 
1.879 


1024* 


2.366 







Code is found in Digital Signal Processing Applications With the TMS320 Family, 
Volume 3. 



11.4.5 Lattice Filters 



The lattice form is an alternative way of implementing digital filters; it has found 
applications in speech processing, spectral estimation, and other areas. In this 
discussion, the notation and terminology from speech processing applications 
are used. 

If H(z) is the transfer function of a digital filter that has only poles, A(z) = 1/H(z) 
will be a filter having only zeros, and it will be called the inverse filter. The in- 
verse lattice filter is shown in Figure 11-5. These equations describe the filter 
in mathematical terms: 

f {i,ri) = f(i- 1,n) + k (/') b (/ -1 ,n -1 ) 
b {i,n) = b (i-i,n -1) + k (/') /(/-1 ,n) 

Initial conditions: 
f(0,n) = b(0,n) = x(n) 

Final conditions: 
y{n) = f{p,n). 

In the above equation, f(i,n) is the forward error, b {i,n) is the backward error, 
k (/' ) is the /-th reflection coefficient, x (n) is the input, and y (n) is the output 
signal. The order of the filter (i.e., the number of stages) is p. In the linear pre- 
dictive coding (LPC) method of speech processing, the inverse lattice filter is 
used during analysis, and the (forward) lattice filter during speech synthesis. 



11-82 



Software Applications 



Application-Oriented Operations 



ifeWSSSSSSSftSrWrWWStW; 



Figure 11-5. Structure of the Inverse Lattice Filter 



x(n) f(0, n) 



b(0, n) 



z-1 




f(1,n^ 



b(1,n) 



z-1 




„ f(p-1,J?) 



-^v— -^ 



b(p-1,n) 



z-1 



f(p, n) = y(n) 




Figure 1 1 -6 shows the data memory organization of the inverse lattice-filter on 
the TMS320C3X. 



Figure 11-6. Data Memory Organization for Lattice Filters 





Reflection 
Coefficients 




Backward 
Propagation Terms 


Low 
Address 


k(1) 




b(0, n-1) 




k(2) 


b(1,n-1) 




• 
• 
• 




a 
o 
o 


High 
Address 


k(p) 


b(p-1,n-1) | 



11-83 



Application-Oriented Operations 



y.*WrWSS*WMWSSSSSSSSrWSf 



wwa^^SfWrwrtwftWfcwsftW* 



Example 11-39. Inverse Lattice Filter 



'*• TITL INVERSE LATTICE FILTER 

* 

* SUBROUTINE L A T I N V 



LATINV 



LATTICE FILTER (LPC INVERSE FILTER - ANALYSIS) 



TYPICAL CALLING SEQUENCE: 



load 


R2 


load 


ARO 


load 


AR1 


load 


RC 


CALL 


LATINV 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

+ 



R2 

ARO 

AR1 

RC 



f (0,n) = x(n) 

ADDRESS OF FILTER COEFFICIENTS (k(l)) 

ADDRESS OF BACKWARD PROPAGATION 

VALUES (b(0,n-l)) 
RC = p - 2 



REGISTERS USED AS INPUT: R2, ARO, AR1, RC 

REGISTERS MODIFIED: RO, Rl, R2, R3, RS, RE, RC, ARO, AR1 

REGISTER CONTAINING RESULT: R2 (f(p,n)) 



PROGRAM SIZE: 10 WORDS 
EXECUTION CYCLES: 13 + 3 * (p-1) 



■global LATINV 



LATINV MPYF3 *AR0, *AR1, RO 



LDF 
MPYF3 



R2 , R3 

*AR0++(1) ,R2,R1 



k(l) * b(0,n-l) -> RO 

Assume f (0,n) -> R2 . 

Put b(0,n) = f (0,n) -> R3 . 

k(l) * f (0,n) -> Rl 
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* 2 <= i <= p 
* 

RPTB LOOP 

MPYF3 *AR0,*++AR1(1) , RO ;k(i) * b(i-l,n-l) -> RO 

II ADDF3 R2,R0,R2 ; f (i-1-1, n) +k (i-1) 

* / *b (i-1-1, n-1) 

* ; = f(i-l,n) -> R2 

* ; b (i-1-1, b-l)+k (i-1) *f (i-1-1, n) 
ADDF3 *-ARl(l), Rl, R3 ; = b(i-l,n) -> R3 

II STF R3, *-ARl(l) ; b(i-l-l,n) -> b ( i-1-1 , n-1 ) 



LOOP MPYF3 *AR0++(1) ,R2,R1 
* 

* I = P+l (CLEANUP) 



k(i) * f (i-l,n) -> Rl 



I I 



ADDF3 R2,R0,R2 




; f (p-l,n)+k(p)*b(p-l,n-l) 
; = f (p,n) -> R2 

; b(p-l,n-l)+k(p)*f (p-l,n) 


ADDF3 *AR1, Rl, 


R3 


; = b (p,n) -> R3. 


STF R3, *AR1 




; b(p-l,n) -> b(p-l,n-l) 


RETURN SEQUENCE 






RETS 




; RETURN 


end 






.end 







The forward lattice filter is similar in structure to the inverse filter, as shown in 
Figure 11-7. These corresponding equations describe the lattice filter: 

f (/ -1 , n) = f (\,n) - k (/■) b (i -1 ,n -1 ) 
b (i,n) = b (/ -1 ,n--\) + k (/) /(/-1 ,n) 

Initial conditions: 

f{p,n) =x(/7),6(i,/7-1)=0. for/ =1,...,p 

Final conditions: 
y(n)=f(0,n). 

The data memory organization is identical to that of the inverse filter, as shown 
in Figure 11-6. Example 11-40 shows the implementation of the lattice filter 
on the TMS320C3X. 
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Figure 11-7. Structure of the (Forward) Lattice Filter 




Example 11-40. Lattice Filter 

* TITL LATTICE FILTER 



SUBROUTINE L A T ICE 

LOAD ARO 

LOAD AR1 

LOAD RC 

CALL LATICE 



ARGUMENT ASSIGNMENTS: 
ARGUMENT | FUNCTION 

4- 



R2 
ARO 

AR1 
IRO 
RC 



F(P,N) = E(N) = EXCITATION 

ADDRESS OF FILTER COEFFICIENTS (K(P)) 

ADDRESS OF BACKWARD PROPAGATION VALUES (B(P-1,N-1)) 

3 

RC = P - 3 



REGISTERS USED AS INPUT: R2, ARO, AR1, RC 

REGISTERS MODIFIED: RO, Rl, R2, R3, RS, RE, RC, ARO, AR1 

REGISTER CONTAINING RESULT: R2 (f(0,n)) 

STACK USAGE: NONE 

PROGRAM SIZE: 12 WORDS 

EXECUTION CYCLES: 15 + 3 * (P-2) 

•global LATICE 



LATICE MPYF3 



I I 



I I 



SUBF3 
MPYF3 

SUBF3 
MPYF3 

MPYF3 
ADDF3 



*AR0,*AR1,R0 

K(P) * B(P-1,N-1) -> RO 

ASSUME F(P,N) -> R2. 
R0,R2,R2 ; F (P, N) -K (P) *B (P-l, N-l) =F(P-1,N) ->R2 
* — ARO(l) ,* — AR1 (1) ,R0 

; K(P-l) * B (P-2, N-l) -> RO 
R0,R2,R2 ; F (P-1,N) -K (P-l) *B (P-2, N-l) =F(P-2,N) -> R2 

*--AR0 (1) ,*--ARl(l) ,R0 

; K(P-2) * B(P-3,M-1) -> RO 

R2,*+AR0(1) ,R1 ; F(P-2,N) * K(P-l) -> Rl 

R1,*+AR1(1) ,R3 ; F(P-2,N) * K(P-l) + B(P-2,N-1) = B(P-1,N) -> R3 
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RPTB 

SUBF3 
I | MPYF3 

STF 
I | MPYF3 
LOOP ADDF3 
STF 
STF 



1 <= I <= P-2 



LOOP 

R0,R2,R2 ; F(I,N) 
*- -r&RO ( 1 ) , *- -AR1 ( 1 ) , RO 
K(I-l) 
R3,*+AR1 (IRO) 
R2, *+AR0 (1) ,R1 
R1,*+AR1 (1) ,R3 
R3,*+AR1 (2) 
R2,*+AR1 (1) 



K(I) * B(I-1,N-1) =F(I-1,N) -> R2 

B(I-2,N-1) -> RO 
B(I+1,N) -> B(I+1,N-1) 
F(I-1,N) * K(I) -> Rl 

F(I-1,N) * K(I) + B(I-1,N-1) =B(I,N) -> R3 
B(1,N) -> B(1,N-1) 
F(0,N) -> B(0,N-1) 



RETURN SEQUENCE 

RETS 
end 

.end 
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11.5 Programming Tips 



Programming style is highly personal and reflects each individual's prefer- 
ences and experiences. The purpose of this section is not to impose any partic- 
ular style. Instead, it emphasizes some of the features of the TMS320C3x that 
can help in producing faster and/or shorter programs. The tips cover both C 
compiler and assembly language programming. 



11.5.1 C-Callable Routines 



The TMS320C3x was designed with a large register file, software stack, and 
large memory space in order to implement a high-level language (HLL) compil- 
er easily. The first such implementation supplied is a C compiler. Use of the C 
compiler increases the transportability of applications that have been tested 
on large, general-purpose computers, and decreases their porting time. 

For best usage of the compiler, complete the following steps: 

1 ) Write the application in the high-level language. 

2) Debug the program. 

3) Estimate if it runs in real-time. 

4) If it doesn't, identify places where most of the execution time is spent. 

5) Optimize these areas by writing assembly language routines that imple- 
ment the functions. 

6) Call the routines from the C program as C functions. 

When writing a C program, you can increase the execution speed by maximiz- 
ing the use of register variables. For more information, refer to the TMS320C3x 
C Compiler Reference Guide. 

Certain conventions must be observed in writing a C-callable routine. These 
conventions are outlined in the Runtime Environment chapter of the 
TMS320C3x C Compiler Reference Guide. Certain registers are saved by the 
calling function, and others need to be saved by the called function. The C com- 
piler manual helps achieve a clean interface. The end result is the readability 
and natural flow of a high-level language combined with the efficiency and spe- 
cial-feature use of assembly language. 



11.5.2 Hints for Assembly Coding 



Each program has particular requirements. Not all possible optimizations will 
make sense in every case. The suggestions presented in this section can be 
used as a checklist of available software tools. 
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Q Use delayed branches. Delayed branches execute in a single cycle; reg- 
ular branches execute in four. The following three instructions are also ex- 
ecuted whether the branch is taken or not. If fewer than three instructions 
can be used, use the delayed branch and append NOPs. Machine cycles 
(time) are still being saved. 

□i Apply the repeat single/block construct. In this way, loops are achieved 
with no overhead. Nesting such constructs will not normally increase effi- 
ciency, so try to use the feature on the most often performed loop. Note 
that RPTS is not interruptible, and the executed instruction is not refetched 
for execution. This frees the buses for operands. 

Q Use parallel instructions. It is possible to have a multiply in parallel with 
an add (or subtract) and to have stores in parallel with any multiply or ALU 
operation. This increases he number of operations executed in a single 
cycle. For maximum efficiency, observe the addressing modes used in 
parallel instructions and arrange the data appropriately. 

Q Maximize the use of registers. The registers are an efficient way to ac- 
cess scratch-pad memory. Extensive use of the register file facilitates the 
use of parallel instructions and helps avoid pipeline conflicts when you use 
the registers in addressing modes. 

Q Use the cache. Especially in conjunction with external slow memory. The 
cache is transparent to the user, so make sure that it is enabled. 

□i Use internal memory instead of external memory. The internal 
memory (2K x 32 bits RAM and 4K x 32 bits ROM) is considerably faster 
to access. In a single cycle, two operands can be brought from internal 
memory. You can maximize performance if you use the DMA in parallel 
with the CPU to transfer data to internal memory before you operate on 
them. 

Qi Avoid pipeline conflicts. If there is no problem with program speed, ig- 
nore this suggestion. For time-critical operations, make sure that cycles 
are not missed because of conflicts. To identify conflicts, run the trace func- 
tion on the developmenttools (simulator, emulators) with the program trac- 
ing option enabled. The tracing immediately identifies the pipeline con- 
flicts. Consult the appropriate section of this User's Guide for an explana- 
tion of the reason for the conflict. You can then take steps to correct the 
problem. 

The above checklist is not exhaustive, and it does not address the more de- 
tailed features outlined in the different sections of this manual. To learn how 
to exploit the full power of the TMS320C3x, Tl recommends careful study of 
the architecture, hardware configuration, and instruction set of the device, de- 
scribed in earlier chapters. 
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Chapter 12 

Hardware Applications 



The TMS320C3x's advanced interface design can be used to implement a 
wide variety of system configurations. Its two external buses and DMA capabil- 
ity provide a parallel 32-bit interface to external devices, while the interrupt in- 
terface, dual serial ports, and general-purpose digital I/O provide communica- 
tion with a muJtitude of peripherals. 

This chapter describes how to use the TMS320C3x's interfaces to connect to 
various external devices. Specific discussions include implementation of par- 
allel interface to devices with and without wait states, use of general-purpose 
I/O, and system control functions. All interfaces shown in this chapter have 
been built and tested to verify proper operation and apply to the 
TMS320C30-33. Comparable designs for the other TMS320C3x devices can 
be implemented with appropriate logic. 

Major topics discussed in this chapter are as follows: 

Qi System Configuration Options Overview (Section 12.1 on page 12-2) 

Qi Primary Bus Interface (Section 12.2 on page 12-4) 

■ Zero Wait-State Interface to RAMs 
a Ready Generation 

o Bank Switching Techniques 

Qi Expansion Bus Interface (Section 12.3 on page 12-18) 

■ A/D Converter Interface 

■ D/A Converter Interface 

Qi System Control Functions (Section 12.4 on page 12-25) 

B Clock Oscillator Circuitry 
B Reset Signal Generator 

Qi Serial Port Interface (Section 12.5 on page12-30) 

Oil User Target Design Considerations When Using the XDS1000 
(Section 12.6 on page 12-34) 

□ User Target Design Considerations When Using the Hewlett Package 
64776 Analysis Subsystem (Section 12.7 on page 12-37). 

Qi TMS320C30 and TMS320C31 Differences (Section 1 2.8 on page 1 2-41 ). 
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12.1 System Configuration Options Overview 



The various TMS320C3x interfaces connect to a wide variety of different de- 
vice types. Each of these interfaces is tailored to a particular family of devices. 



12.1.1 Categories of Interfaces on the TMS320C3x 



The interface types on the TMS320C3x fall into several different categories, 
depending on the devices to which they are intended to be connected. Each 
interface comprises one or more signal lines that transfer information and con- 
trol its operation. Shown in Figure 12-1 are the signal line groupings for each 
of these various interfaces. 



Figure 12-1. External Interfaces on the TMS320C3x 
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All of the interfaces are independent of one another and different operations 
may be performed simultaneously on each interface. 

The primary and expansion buses implement the memory-mapped interface 
to the device. The external DMA interface allows external devices to cause the 
processor to relinquish the primary bus and allow direct memory access. 
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12.1.2 Typical System Block Diagram 



The devices that can be interfaced to the TMS320C3x include memory, DMA 
devices, and numerous parallel and serial peripherals and I/O devices. 
Figure 1 2-2 illustrates a typical configuration of a TMS320C3x system with dif- 
ferent types of external devices and the interfaces to which they are connected. 



Figure 12-2. Possible System Configurations 
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This block diagram constitutes essentially a fully expanded system. In an ac- 
tual design, any subset of the illustrated configuration may be used as appro- 
priate. 
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12.2 Primary Bus Interface 



The TMS320C3x uses the primary bus to access the majority of its 
memory-mapped locations. Therefore, typically, when a large amount of exter- 
nal memory is required in a system, it is interfaced to the primary bus. The ex- 
pansion bus (discussed in Section 1 2.3 on page 1 2-1 8) a ctua lly compr ises two 
mutually exclusive interfaces, controlled by the MSTRB an d IOSTR B signals, 
respectively. Cycles on the expansion bus controlled by the MSTRB signal are 
essentially equivalent to cycles on the primary bus, with the exception that 
bank switching is not implemented on the expansion bus. Acco rdingly, t he dis- 
cussion of primary bus cycles in this section applies equally to MSTRB cycles 
on the expansion bus. 

Although both the primary bus and the expansion bus may be used to interface 
to a wide variety of devices, the devices most commonly interfaced to these 
buses are memories. Therefore, detailed examples of memory interface are 
presented in this section. 

12.2.1 Zero Wait-State Interface to Static RAMs 

Zero wait-state read access time for the TMS320C3x is determined by the dif- 
ference between the cycle time (specification 1 on page 1 3-21 ) and the sum 
of the times for H1 low to address valid (specification 11 on page 13-23) and 
data setup before next H1 low (specification 1 5.1 on page 1 3-23). For exam- 
ple, for full-speed, zero wait-state interface to any device, the 60-ns 
TMS320C3x requires a read access time of 30 ns from address stable to data 
valid. Because, for most memories, access time from chip select is the same 
as access time from address, it is theoretically possible to use 30-ns memories 
at full speed with the TMS320C3x-33. This, however, dictates that there be no 
delays present between the processor and the memories. This is usually not 
the case in practice, because of interconnection delays and the fact that some 
gating is normally required for chip-select generation. Therefore, slightly faster 
memories are generally required in most systems. 

Among currently available RAMs, there are two distinct categories of devices 
with different interface characteristics: RAMs without output enable control 
lines (OE), which include the 1-bit wide organized RAMs and most of the 4-bit 
wide RAMs, and those with OE controls, which include the byte-wide and a few 
of the 4-bit wide RAMs. Many of the fastest RAMs do not provide OE control; 
they use chip-select (CS) controlled write cycles to insure that data outputs do 
not turn on for write operations. In CS-controlled write cycles, the write control 
line (WE) goes low before CS goes low, and internal logic holds the outputs 
disabled until the cycle is completed. Using CS-controlled write cycles is an ef- 
ficient way to interface fast RAMs without OE controls to the TMS320C30 at 
full speed. 

In the case of RAMs with OE controls, the use of this signal can provide added 
flexibility in many systems. Additionally, many of these devices can be inter- 

■| 2-4 Hardware Applications 



Primary Bus Interface 



faced using CS-controlled write cycles with OE tied low, in the same manner 
as with RAMs without OE controls. There are, however, two requirements for 
interfacing to OE RAMs in this fashion. First, the RAMs OE input must be gated 
with chip select and WE internally so that the device's outputs do not turn on 
unless a read is being performed. Second, the RAM must allow its address in- 
puts to change while WE is low; some RAMs specifically prohibit this. 

Figure 12-3 shows the TMS320C3x interfaced to Cypress Semiconductor's 
CY7C1 86 25-ns J5K x 8-bit CMOS static RAMs with the OE control input tied 
low and using a CS-controlled write cycle. 



Figure 12-3. TMS320C3x Interface to Cypress Semiconductor CY7C186 CMOS SRAM 
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In this circuit, the two chip selects on the RAM a re dr iven by STRB and A23, 
which are ANDed together internally. The use of A23 locat es the RAM at ad- 
dresses OOOOOh through 03FFFh in external memory, and STRB establishes 
the CS-controlled write cycle. The WE control input is then driven by the 



12-5 



Primary Bus Interface 



TMS320C3x R/W signal, and the OE input is not used and is therefore con- 
nected to ground. 

The timing of read operations, shown in Figure 12-4, is very straightforward 
because the two chip select inputs are driven directly. The read access time 
of the circuit is therefore the inverter propagation delay added to the RAMs chip 
select access time, or t-| + 12 = 5 + 25 = 30 ns. This access time therefore meets 
the TMS320C3x-33's specified 30-ns read access time requirement. 



Figure 12-4. Read Operations Timing 
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During write operations, as shown in Figure 12-5, the RAM's outputs do not 
turn on at all, because of the use of the chip-select controlled write cycles. The 
chip-select con trolled write cycles are generated because R/W goes active 
(low) before the STRB term of the chip-select input. Because the RAMs output 
drivers are disabled whenever the WE input is low (regardless of the state of 
the OE input), bus conflicts with the TMS320C3x are automatically avoided 
with this interface. The circuit's data setup and hold times (t-| and t£ in the timing 
diagram) of approximately 50 and 20 ns, respectively, also easily meet the 
RAM's timing requirements of 1 and ns. 
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Figure 12-5. Write Operations Timing 
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If more complex chip-select decode is required than can be accomplished in 
time to meet zero-wait state timing, wait states or bank switching techniques 
(discussed in a later section) should be used. 

Note that the CY7C1 86's OE control is gated internally with CS; therefore, the 
RAM's outputs are not enabled unless the device is selected. This is critical if 
there are any other devices connected to the same bus; if there are no other 
devices connected to the bus, then OE need not be gated internally with chip 
select. 

RAMs without OE controls can also be easily interfaced to the TMS320C3x by 
using a similar approach to that used with RAMs with OE controls. If only one 
bank of memoryjs imp'emented, and no other device s are p resent on the bus, 
the memories' CS input can usually be connected to STRB directly. If several 
devices must b e select ed, however, agate is generally required to AND the de- 
vice select and STRB to drive the CS input to generate the chip-select con- 
trolled write cycles. In either case, the WE input is driven by the TMS320C3x 
R/W signal. Provided sufficiently fast gating is used, 25-ns RAMs may still be 
used. 

As with the case of RAMs with OE control lines, this approach works well if only 
a few banks of memory are implemented where the chip-select decode can be 
accomplished with only one level of gating. If many banks are required to imple- 
ment very large memory spaces, bank switching can be used to provide for 
multiple bank select generation while still maintaining full-speed accesses 
within each bank. Bank switching is discussed in detail in a later section. 
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12.2.2 Ready Generation 



The use of wait states can greatly increase system flexibility and reduce hard- 
ware requirements over systems without wait-state capability. The 
TMS320C3x has the capability of generating wait states on either the primary 
bus or the expansion bus, and both buses have independent sets of ready con- 
trol logic.This subsection discusses ready generation from the perspective of 
the primary bus interface; however, wait-state operation on the expansion bus 
is similar to that of the primary bus. Therefore, these discussions pertain equal- 
ly well to expansion bus operation. Accordingly, ready generation is not in- 
cluded in the specific discussions of the expansion bus interface. 

Wait states are generated on the basis of 

Qi the internal wait-state generator, 



□ the external ready input (RDY), or 

□ the logical AND or OR of the two. 

When enabled, internally generated wait states effect all external cycles, re- 
gardless of the address accessed. If different numbe rs of wait states are re- 
quired for various external devices, the external RDY input may be used to tai- 
lor wait-state generation to specific system requirements. 

If the logical AND (electrical OR) of the wait count and external ready signals 
is selected, the later of the two signals will control the internal ready signal, and 
both signals must occur. Accordingly, external ready control must be implem- 
ented for each wait-state device, and the wait count ready signal must be en- 
abled. 

If the logical OR (or electrical AND, since the signals are low true) of the exter- 
nal and internal wait-count ready signals is selected, the earlier of the two sig- 
nals will generate a ready condition and allow the cycle to be completed. It is 
not required that both signals be present. 

ORing of the Ready Signals 

The OR of the two ready signals can be used to implement wait states for de- 
vices that require a greater number of wait states than are implemented with 
external logic (up to seven). This feature is useful, for example, if a system con- 
tains some fast and some slow devices. In this case, fast devices can generate 
a ready signal externally with a minimum of logic, and slow devices can use 
the internal wait counter for larger numbers of wait states. Thus, when fast de- 
vices are accessed, the external hardware responds promptly with a ready sig- 
nal that terminates the cycle. When slow devices are accessed, the external 
hardware does not respond, and the cycle is appropriately terminated after the 
internal wait count. 

The OR of the two ready signals may also be used if conditions occur that re- 
quire termination of bus cycles prior to the number of wait states implemented 
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with external logic. In this case, a shorter wait count is specified internally than 
the number of wait states implemented with the external ready logic, and the 
bus cycle is terminated after the wait count. This feature may also be used as 
a safeguard against inadvertent accesses to nonexistent memory that would 
never respond with ready and would therefore lock up the TMS320C3X. 

If the OR of the two ready signals is used, however, and the internal wait-state 
count is less than the number of wait states implemented externally, the exter- 
nal ready generation logic must have the ability to reset its sequencing to allow 
a new cycle to begin immediately following the end of the internal wait count. 
This requires that, under these conditions, consecutive cycles must be from 
independently decoded areas of memory and that the external ready genera- 
tion logic be capable of restarting its sequence as soon as a new cycle begins. 
Otherwise, the external ready generation logic may lose synchronization with 
bus cycles and therefore generate improperly timed wait states. 

ANDing of the Ready Signals 

The AND of the two ready signals can be used to implement wait states for de- 
vices that are equipped to provide a ready signal but cannot respond quickly 
enough to meet the TMS320C3x's timing requirements. In particular, if these 
devices normally indicate a ready condition and, when accessed, respond with 
a wait until they become ready, the logical AND of the two ready signals can 
be used to save hardware in the system. In this case, the internal wait counter 
can provide wait states initially and becomes ready after the external device 
has had time to send a not ready indication. The internal wait counter then re- 
mains ready until the external device also becomes ready, which terminates 
the cycle. 

Additionally, the AND of the two ready signals may be used for extending the 
number of wait states for devices that already have external ready logic im- 
plemented but require additional wait states under certain unique circum- 
stances. 

External Ready Generation 

In the implementation of external ready generation hardware, the particular 
technique employed depends heavily on the specific characteristics of the sys- 
tem. The optimum approach to ready generation varies, depending on the rela- 
tive number of wait-state and non-wait-state devices in the system and on the 
maximum number of wait states required for any one device. The approaches 
discussed here are intended to be general enough for most applications and 
are easily modifiable to comprehend many different system configurations. 

In general, ready generation involves the following three functions: 

1 ) Segmentation of the address space in some fashion to distinguish fast and 
slow devices. 
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2) Generating properly timed ready indications. 

3) Logically ORing all of the separate ready timing signals together to connect 
to the physical ready input. 

Segmentation of the address space is required to obtain a unique indication 
of each particular area within the address space that requires wait states. This 
segmentation is commonly implemented in a system in the form of chip-select 
generation. Chip-select signals may be used to initiate wait states in many 
cases; however, occasionally, chip-select decoding considerations may pro- 
vide signals that will not allow ready input timing requirements to be met. In this 
case, coarse address space segmentation may be made on the basis of a 
small number of address lines, where simpler gating allows signals to be gen- 
erated more quickly. In either case, the signal indicating that a particular area 
of memory is being addressed is normally used to initiate a ready or wait-state 
indication. 

Once the region of address space being accessed has been established, a tim- 
ing circuit of some sort is normally used to provide a ready indication to the pro- 
cessor at the appropriate point in the cycle to satisfy each device's unique re- 
quirements. 

Finally, since indications of ready status from multiple devices are typ ically 
present, the signals are logically ORed by using a single gate to drive the RDY 
input. 

Ready Control Logic 

One of two basic approaches can be taken in the implementation of ready con- 
trol lo gic, depending upon the state of the ready input between accesses. If 
RDY is low betwe en acc esses, the processor is always ready unless a wait 
state is required; if RDY is high between accesses, the processor will always 
enter a wait state unless a ready indication is generated. 



If RDY is low between accesses, control of full-speed devices is straightfor- 
ward; no action is necessary because ready is always active unless otherwise 
required. Devices requiring wait states, however, must drive ready high fast 
enough to meet the input timing requirements. Then, after an appropriate 
delay, a ready indication must be generated. This can be quite difficult in many 
circumstances because wait-state devices are inherently slow and often re- 
quire complex select decoding. 



If RDY is high between accesses, zero wait-state devices, which tend to be in- 
herently fast, can usually respond immediately with a ready indication. Wait- 
state devices may simply delay their select signals appropriately to generate 
a ready. Typically, this approach results in the most efficient implementation 
of ready control logic. Figure 12-6 shows a circuit of this type, which can be 
used to generate 0, 1 , or 2 wait states for multiple devices in a system. 
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Figure 12-6. Circuit for Generation of 0, 1, or 2 Wait States for Multiple Devices 
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Example Circuit 

In this circuit, full-speed devices drive ready directly through the 74AS21 , and 
the two flip-flops delay wait-state devices' select signals one or two H1 cycles 
to provide 1 or 2 wait states. 

Considering the TMS320C3x-33's ready delay time of 8 ns following address, 
zero wait-state devices must use ungated address lines directly to drive the in- 
put of the 74AS21 , since this gate contributes a maximum propagation delay 
of 6 ns to the RDY signal. Thus, zero wait-state devices should be grouped to- 
gether within a coarse segmentation of address space if other devices in the 
system require wait states. 

With this circuit, devices requiring wait states may take up to 36 ns from a valid 
address on the TMS320C3x to provide inputs to the 74AS20's inputs. Typical- 
ly, this allows sufficient time for any decoding required in generating select sig- 
nals for slow er dev ices in the system. For example, the 74ALS138, driven by 
address and STRB, can generate select decodes in 22 ns, which easily meets 
the TMS320C3x-33's timing requirements. 
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With this circuit, unused inputs to either the 74AS20s or the 74AS21 should be 
tied to a logic high level to prevent noise from generating spurious wait states. 

If more than 2 wait states are required by devices within a system, other ap- 
proaches may be employed for ready generation. If between three and seven 
wait states are required, additional flip-flops may be included in the same man- 
ner shown in Figure 12-6, or internally generated wait states may be used in 
conjunction with external hardware. If more than seven wait states are re- 
quired, an external circuit using a counter may be used to supplement the ca- 
pabilities of the internal wait-state generators. 

12.2.3 Bank Switching Techniques 

The TMS320C3x's programmable bank switching feature can greatly ease 
system design when large amounts of memory are required. Because, in gen- 
eral, devices take longer to release the bus than they take to drive the bus, 
bank switching is used to provide a period of time for disabling all device selects 
that would not normally be. present otherwise (refer to Section 7.4 for further 
information regarding bank switching). During this interval, slow devices are 
allowed time to turn off before other devices have the opportunity to drive the 
data bus, thus avoiding bus contention. 

When bank switching is enabled, any time a portion of the high order a ddress 
lines changes, as defined by the content s of th e BNKCMPR register, STRB 
goes high for one full H1 cycle. Provided STRB is included in chip-select de- 
codes, this causes all devices t o be di sabled during this period. The next bank 
of devices is not enabled until STRB goes low again. 

In general, bank switching is not required during writes, because these cycles 
alw ays exh ibit an inherent one-half H1 cycle setup of address information be- 
fore STRB goes low. Thus, when you use bank switching for read/write de- 
vices, a minimum of half of one H1 cycle of address setup is provided for all 
accesses. Therefore, large amounts of memory can be implemented without 
wait states or extra hardware required for isolation between banks. Also, note 
that access time for cycles during bank switching is the same as that of cycles 
without bank switching, and, accordingly, full-speed accesses may still be ac- 
complished within each bank. 

When you use bank switching to implement large multiple-bank memory sys- 
tems, an important consideration is address line fanout. Besides parametric 
specifications for which account must be made, AC characteristics are also 
crucial in memory system design. With large memory arrays, which commonly 
require large numbers of address line inputs to be driven in parallel, capacitive 
loading of address outputs is often quite large. Because all TMS320C3x timing 
specifications are guaranteed up to a capacitive load of 80 pF, driving greater 
loads will invalidate guaranteed AC characteristics. Therefore, it is often nec- 
essary to provide buffering for address lines when driving large memory ar- 
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rays. AC timings for buffer performance may then be derated according to 
manufacturer specifications to accommodate a wide variety of memory array 
sizes. 

The circuit shown in Figure 1 2-7 illustrates the use of bank switching with Cy- 
press Semiconductor's 'CY7C1 85 25-ns 8K x 8 CMOS static RAM. This circuit 
implements 32K 32-bit words of memory with one wait-state accesses within 
each bank. 



Figure 12-7. Bank Switching for Cypress Semiconductor's C Y7C185 



BANKSELO - 
BSTRB 
BR/W ■ 



+ 15V 



BA11 



BANKSEL 



BSTRB 



*~ 



A12 V CC 

A11 

A10 

A9 

A8 

A7 

A6 

A5 

A4 

A3 

A2 

A1 

AO 

CS1 
CS2 
WE 
OE_ 



Bank 2 



BANKSEL3 - 



Bank 3 



D31-D0" 



+ 15V 



BA11 



BA5 



BA1 



BAO 



BANKSEL 



BSTRB 



V~ 



A12 V CC 

A11 

A10 

A9 

A8 

A7 

A6 

A5 

A4 

A3 

A2 

A1 

AO 

csT 

CS2 

WE 
OE„ 



BankO 



BA11 



BA8 



BANKSEL 



BSTRB 



V~ 



A12 V CC 
A11 



A10 

A9 

A8 

A7 

A6 

A5 

A4 

A3 

A2 

A1 

AO 

csT 

CS2 
WE 
OE_ 



Data Bus D31-D0 



BA10 



BA5 



BA4 



BA3 



BANKSEL 



BSTRB 



f~ 



A12 V CC 
A11 
'A10 
A9 
A8 
A7 
A6 
A5 
A4 
A3 
A2 
A1 
AO 

CST 
CS2 

WE 
OE 



GND 



A wait state is required with this implementation of bank memory because of 
the added propagation delay presented by the address bus buffers used in the 
circuit. The wait state is not a function of the memory organization of multiple 
banks or the use of bank switching. When bank switching is used, memory ac- 
cess speeds are the same as without bank switching, once bank boundaries 
are crossed. Therefore, no speed penalty is paid when bank switching is used, 
except for the occasional extra cycle inserted when bank boundaries are 
crossed. Note, however, that if the extra cycle inserted when bank boundaries 
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are crossed does impact software performance significantly, code can often 
be restructured to minimize bank boundary crossings, thereby reducing the ef- 
fect of these boundary crossings on software performance. 

The wait state for this bank memory is generated by using the wait-state gener- 
ator circuit presented in the previous section. Because A23 is the signal which 
enables the entire bank memory system, the inverted version of this signal is 
ANDed with STRB to derive a one wait-state device select. This signal is then 
connected in the circuit along with the other one-wait-state device selects. 
Thus, any time a bank memory access is made, one wait state is generated. 

Each of the four banks in this circuit is selected by using a decode of A1 5-A1 3 
generated by the 74AS138 (see Figure 12-8). With the BNKCMPR register 
set to OBh, the banks will be selected on even 8K-word boundaries starting at 
location 080A000h in external memory space. 
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Figure 12-8. Bank Memory Control Logic 
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The 74ALS2541 buffers used on the address lines are necessary in this design 
because the total capacitive load presented to each address line is a maximum 
of 20 x 5 pF or 1 00 pF (bank memory plus zero wait-state static RAM), which 
exceeds the TMS320C3x rated capacitive loading of 80 pF. Using the man- 
ufacturers derating curves for these devices at a load of 80 pF (the load pres- 
ented by the bank memory) predicts propagation delays at the output of the 
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buffers of a maximum of 1 6 ns. The access time of a read cycle within a bank 
of the memory is therefore the sum of the memory access time and the maxi- 
mum buffer propagation delay, or 25 + 1 6 = 41 ns, which, since it falls between 
30 and 90 ns, requires one wait state on the TMS320C3x-33. 

The 74ALS2541 buffers offer one additional system performance enhance- 
ment in that they include 25-ohm resistors in series with each individual buffer 
output. These resistors greatly improve the transient response characteristics 
of the buffers, especially when driving CMOS loads such as the memories used 
here. The effect of these resistors is to reduce overshoot and ringing, which 
is common when driving predominantly capacitive loads such as CMOS. The 
result of this is reduced noise and increased immunity to latchup in the circuit, 
which in turn results in a more reliable memory system. Having these resistors 
included in the buffers eliminates the need to put discrete resistors in the sys- 
tem, which is often required in high-speed memory systems. 

This circuit cannot be implemented without bank switching, because data out- 
put's turn-on and turn-off delays cause bus conflicts. Here, the propagation 
delay of the 74AS1 38 is involved only during bank switches, where there is suf- 
ficient time between cycles to allow new chip selects to be decoded. 

The timing of this circuit for read operations using bank switching is shown in 
Figure 1 2-9. With the BNKCMPR register set to OBh, when a bank switch oc- 
curs, the bank add ress o n address lines A23 — A1 3 is updated during the ex- 
tra H1 cycle while STRB is high. Then, after chip-select deco des ha ve stabi- 
lized and the previously selected bank has disabled its outputs, STRB goes low 
for the next read cycle. Further accesses occur at normal bus timings with one 
wait state, as long as another bank switch is not necessary. Write cycles do not 
require bank switching due to the inherent address setup provided in their tim- 
ings. 
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Figure 12-9. Timing for Read Operations Using Bank Switching 
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This timing is summarized in Table 12-1, 



Table 12-1. Bank Switching Interface Timing 



Time 
interval 


Event 


Timet 
Period 


tl 


H1 falling to address valid/STRB rising 


14 ns 


t2 


Address valid to select delay 


10 ns 


t3 


Memory disable from STRB 


10 ns 


U 


H1 falling to STRB 


10 ns 


t5 


STRB to select delay 


4.5 ns 


t 6 


Memory output enable delay 


3 ns 



t Timing for the TMS320C3X-33. 
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12.3 Expansion Bus Interface 



The TMS320C30's expansion bus interface provides a second complete paral- 
lel bus, which can be used to implement data transfers concurrently with, and 
independent of, operations on the primary bus. The expansi on b us compr ises 
two mutually exclusive interfaces controlled by the MSTRB and IOSTRB sig- 
nals, respectiv ely. Thi s subsec tion discusses interface to the expansion bus 
using IOSTRB cycles; MSTRB cycles are essentially equivalent in timing to pri- 
mary bus cycles and are discussed in Section 12.2. This section applies to 
TMS320C30 devices. 

Unlike the primary bus, both read and write cycles on the I/O portion of the ex- 
pansion bus are two H1 cycles in duration and exhibit the same timing. The 
XR/W signal is high for reads and low for writes. Since I/O accesses take two 
cycles, many periphe rals that require wait states if interfaced either to the pri- 
mary bus or by using MSTRB, may be used in a system without the need for 
wait states. Specifically, in cases where there is only one device on the expan- 
sion bus, devices with address access times greater than the 30 ns required 
by the primary bus, but less than 59 ns, can be interfaced to the I/O bus of the 
TMS320C30-33 without wait states. 

A/D Converter Interface 

A/D and D/A converters are components that are commonly required in DSP 
systems and interface efficiently to the I/O expansion bus. These devices are 
available in many speed ranges and with a variety of features. While some may 
require one or more wait states on the I/O bus, others may be used at full 
speed. 

Figure 12-10 illustrates a TMS320C30 interface to an Analog Devices 
AD1678 analog-to-digital converter. The AD1678 is a 12-bit, 5-jis converter 
that allows sample rates up to 200 kHz and has an input voltage range of 1 
volts bipolar or unipolar. The converter is connected according to manufactur- 
er's specifications to provide 0- to +1 0-volt operation. This interface illustrates 
a common approach to connecting devices such as this to the TMS320C30. 
Note that the interface requires only a minimum amount of control logic. 
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Figure 12-10. Interface to AD 1678 A/D Converter 
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The AD1 678 is a very flexible converter and is configurable in a number of dif- 
ferent operating modes. These operating modes include byte or word data for- 
mat, continuous or noncontinuous conversions, enabled or disabled chip-se- 
lect function, and programmable end of conversion indication. This interface 
utilizes 1 2-bit word data format, rather than byte format to be compatible with 
the TMS320C3x. Noncontinuous conversions are selected so that variable 
sample rates may be used because continuous conversions occur only at a 
rate of 200 kHz. With noncontinuous conversions, the host processor deter- 
mines the conversion rate by initiating conversions through write operations 
to the converter. 
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The chip-select function is enabled, so the chip-select input is required to be 
active when accessing the device. Enabling the chip select function is neces- 
sary to allow a mechanism for the AD1 678 to be isolated from other peripheral 
devices connected to the_expansion bus. To establish the desired o perating 
modes, the SYNC and 1 2/8 inputs to the converter are pulled high and EOCEN 
is grounded, as specified in the AD1678 data sheet. 

In this application, the converter's chip select is driven by XA12, which maps 
this device at 804000h in I/O address space. Conversions are initiated by writ- 
ing any data value to the device, and the conversion results are obtained by 
reading from the device after the conversion is completed. To genera te the de- 
vice's start conversion (SC) and output enable (OE) inputs, IOSTRB is ANDed 
with XR/W. Therefore, the converter is selected whenever XA1 2 is low; OE is 
driven when reads are performed, while SC is driven when writes are per- 
formed. 

As with many A/D converters, at the end of a read cycle the AD1 678 data output 
lines enter a high-impedance state. This occurs after the output enable (OE) 
or read control line goes inactive. Also common with these types of devices is 
that the data output buffers often require a substantial amount of time to actual- 
ly attain a full high-impedance state. When used with the TMS320C30-33, de- 
vices must h ave their outputs fully disabled no later than 65 ns following the 
rising edge of IOSTRB because the TMS320C30 will begin driving the data bus 
at this point if the next cycle is a write. If this timing is not met, bus conflicts be- 
tween the TMS320C30 and the AD1678 may occur, potentially causing de- 
graded system performance and even failure due to damaged data bus driv- 
ers. The actual disable time for the AD1 678 can be as long as 80 ns; therefore, 
buffers are required to isolate the converter outputs from the TMS320C30. The 
buffers used here are 74LS244s tha t are enab led when the AD1 678 is read and 
turned off 30.8 ns following IOSTRB going high. Therefore, the 
TMS320C30-33 requirement of 65 ns is met. 

When data is read following a conversion, the AD1678 takes 100 ns after its 
OE control line is asserted to provide valid data at its outputs. Thus, including 
the propagation delay of the 74LS244 buffers, the total access time for reading 
the converter is 118 ns. This requires two wait states on the TMS320C30-33 
expansion I/O bus. 

The two wait states required in this case are implemented using software wait 
states; however, depending on the overall system configuration, it may be nec- 
essary to implement a separate wait-state generator for the expansion bus (re- 
fer to section on ready generation). This would be the case if there were multi- 
ple devices that required different numbers of wait states connected to the ex- 
pansion bus. 

Figure 12-11 shows the timing for read operations between the 
TMS320C30-33 and the AD1678. At the beginning of the cycle, the address 
and XR/W lines become valid t-j = 10 ns following the falling edge of H-j . Then, 
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after t 2 = 10 ns from the next rising edge of H-| , IOSTRB goes low, beginning 
the active port ion o f the read cycle. After t3 = 5.8 nsjhe control logic propaga- 
tion delay, the IOR signal goes low, asserting the OE input to the AD1 678. The 
74LS244 buffers take fy = 30 ns to enable their outputs, and then, following 
the converters access delay and the buffer propagation delay (ts = 100 + 18 
= 118 ns), data is provided to the TMS320C 30. This p rovides approximately 
46 ns of data setup before the rising edge of IOSTRB. Therefore, this design 
easily satisfies the TMS320C30-33's requirement of 15 ns of data setup time 
for reads. 

Figure 12-11. Read Operations Timing Between the TMS320C30 andAD1678 
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Unlike the primary bus, read and write cycles on the I/O expansion bus are 
timed the same with the exception that XR/W is high for reads and lowfor writes 
and that the data bus is driven by the TMS320C30 during writes. When writing 
to the AD1 678, the 74LS244 buffers do not turn on and no data is transferred. 
The purpose of writing to the converter is only to generate a pulse on the con- 
verter's SC input, which initiates a conversion cycle. When a conversion cycle 
is completed, the AD1 678's EOC output is used to generate an interrupt on the 
TMS320C30 to indicate that the converted data may be read. 

It should be noted that for different applications, use of TLC1 225 or TLC1 550 
A/D converters from Texas Instruments may be beneficial. The TLC1225 is a 
self-calibrating 12-bit-plus-sign bipolar or unipolar converter, which features 
1 0-us conversion times. The TLC1 550 is a 1 0-bit, 6-jis converter with a high- 
speed DSP interface. Both converters are parallel-interface devices. 

D/A Converter Interface 

In many DSP systems, the requirement for generating an analog output signal 
is a natural consequence of sampling an analog waveform with an A/D conver- 
ter and then processing the signal digitally internally. Interfacing D/A conver- 
ters to the TMS320C30 on the expansion I/O bus is also quite straightforward. 



12-21 



Expansion Bus Interface 



As with A/D converters, D/A converters are also available in a number of vari- 
eties. One of the major distinctions between various types of D/A converters 
is whether or not the converter includes both latches to store the digital value 
to be converted to an analog quantity, and the interface to control those latches. 
With latches and control logic included with the converter, interface design is 
often simplified; however, internal latches are often included only in slower D/A 
converters. 

Because slower converters limit signal bandwidths, the converter used in this 
design was selected to allow a reasonably wide range of signal frequencies 
to be processed, and to illustrate the technique of interfacing to a converter that 
uses external data latches. 

Figure 1 2-1 2 shows an interface to an Analog Devices AD565A digital-to-ana- 
log converter. This device is a 12-bit, 250-ns current output DAC with an on- 
chip 10-volt reference. Using an offchip current-to-voltage conversion circuit 
connected according to manufacturers specifications, the converter exhibits 
output signal ranges of to +1 volts, which is compatible with the conversion 
range of the A/D converter discussed in the previous section. 
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Figure 12-12. Interface Between the TMS320C30 and the AD565A 
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Because this DAC essentially performs continuous conversions based on the 
digital value provided at its inputs, periodic sampling is maintained by periodi- 
cally updating the value stored in the external latches. Therefore, between 
sample updates, the digital value is stored and maintained at the latch outputs 
that provide the input to the DAC. This results in the analog output remaining 
stable until the next sample update is performed. 

The external data latches used in this interface are 74LS377 devices that have 
both clock and enable inputs. These latches serve as a convenient interface 
with the TMS320C30; the enable inputs provide a device select function, and 
the clock inputs latch the data. Ther efore , with the enable input driven b y in- 
verted XA12 and the clock input by IOW, which is the AND of IOSTRB and 
XR/W, data will be stored in the latches when a write is performed to I/O ad- 
dress 805000h. Reading this address has no effect on the circuit. 
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Figure 1 2-1 3 shows a timing diagram of a write operation to the D/A converter 
latches. 

Figure 12-13. Write Operation to the D/A Converter Timing Diagram 
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Because the write is actually being performed to the latches, the key timings 
for this operation are the timing requirements for these devices. For proper op- 
eration, these latches require simply a minimal setup and hold time of data and 
control signals with respect to the rising edge of the clock input. Specifically, 
the latches require a data setup time of 20 ns, enable setup of 25 ns, disable 
setup of 10 ns, and data and enable hold times of 5 ns. This design provides 
approximately 60 ns of enable setup, 30 ns of data setup, and 7.2 ns of data 
hold time. Therefore, the setup and hold times provided by this design are well 
in excess of those required by the latches. The key timing parameters for this 
interface are summarized in Table 12-2. 



Table 12-2. Key Timing Parameter for D/A Converter Write Operation 



Time 
Interval 


Event 


Timet 
Period 


tl 


H1 falling to address valid 


10 ns 


t2 


XA12toXA12delay 


5 ns 


t3 


H1 rising to IOSTRB falling 


10 ns 


U 


IOSTRB to IOW delay 


5.8 ns 


t5 


Data setup to IOW 


30 ns 


t6 


Data hold from IOW 


7.2 ns 



t Timing for the TMS320C30-33. 
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12.4 System Control Functions 



Several aspects of TMS320C3x system hardware design are critical to overall 
system operation. These include such functions as clock and reset signal gen- 
eration and interrupt control. 



12.4.1 Clock Oscillator Circuitry 



You may provide an input clock to the TMS320C3x either from an external 
clock input or by using the onboard oscillator. Unless special clock require- 
ments exist, the onboard oscillator is generally a convenient method for clock 
generation. This method requires few external components and can provide 
stable, reliable clock generation for the device. 

Figure 1 2-1 4 shows the external clock generator circuit designed to operate 
at 33.33 MHz and to use the internal oscillator circuitry of the TMS320C3x. 
Since crystals with fundamental oscillation frequencies of 30 MHz and above 
are not readily available, a parallel-resonant third-overtone circuit is used. 



Figure 12-14. Crystal Oscillator Circuit 
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In a third-overtone oscillator, the crystal fundamental frequency must be atte- 
nuated so that oscillation is at the third harmonic. This is achieved with an LC 
circuit that filters out the fundamental, thus allowing oscillation at the third har- 
monic. The impedance of the LC circuit must be inductive at the crystal funda- 
mental and capacitive at the third harmonic. The impedance of the LC circuit 
is represented by 



z(co) = 



'{<» l-^t] 



(3) 



Therefore, the LC circuit has a pole at 

1 



C0 r 



'LC 



(4) 
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At frequencies significantly lower than co p , the 1 /(coC) term in (3) becomes the 
dominating term, while coL can be neglected. This is expressed as 



z(co) = jcoL 



for co < cor 



(5) 



In (5), the LC circuit appears inductive at frequencies lower than co p . On the 
other hand, at frequencies much higher than co p , the coL term is the dominant 
term in (3), and 1/(coC) can be neglected. This is expressed as 



1 



z(co) = j coC 



for co > co r 



(6) 



The LC circuit in (6) appears increasingly capacitive as the frequency 
increases above co p . This is shown in Figure 1 2-1 5, which is a plot of the mag- 
nitude of the impedance of the LC circuit of Figure 12-14 versus frequency. 



Figure 12-15. Magnitude of the Impedance of the Oscillator LC Network 
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Based on the discussion above, the design of the LC circuit proceeds as fol- 
lows: 

1 ) Choose the pole frequency co p approximately halfway between the crystal 
fundamental and the third harmonic. 

2) The circuit now appears inductive at the fundamental frequency and ca- 
pacitive at the third harmonic. 

In the oscillator of Figure 12-13, choose co p = 22.2 MHz, which is approxi- 
mately halfway between the fundamental and the third harmonic. Choose 
C = 20 pF. Then, using equation (4), L = 2.6 uH. 



12.4.2 Reset Signal Generation 



The reset input controls initialization of internal TMS320C3x logic and also 
causes execution of the system initialization software. For proper system ini- 
tialization, the reset signal must be applied at least ten H1 cycles, i.e., 600 ns 
for a TMS320C3x operating at 33.33 MHz. Upon powerup, however, it can take 
20 ms or more before the system oscillator reaches a stable operating state. 
Therefore, the powerup reset circuit should generate a low pulse on the reset 
line for 1 00 to 200 ms. Once a proper reset pulse has been applied, the proces- 
sor fetches the reset vector from location zero, which contains the address of 
the system initialization routine. Figure 12-1 6 shows a circuit that will generate 
an appropriate powerup reset circuit. 



Figure 12-16. Reset Circuit 
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The voltage on the reset pin (RESET) is controlled by the R-| C-j network. After 
a reset, this voltage rises exponentially according to the time constant R-|Ci, 
as shown in Figure 12-17. 



Figure 12-17. Voltage on the TMS320C30 Reset Pin 
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The duration of the low pulse on the reset pin is approximately ti , which is the 
time it takes for the capacitor C-j to be charged to 1 .5 V. This is approximately 
the voltage at which the reset input switches from a logic to a logic 1 . The ca- 
pacitor voltage is expressed as 



V = V 



cc 



1-e 



t 



(7) 



where x = R-j C-j is the reset circuit time constant. Solving equation (7) for t re- 
sults in 



t = -R 1 C 1 1n 



1- 



V 



V, 



cc 



(8) 



Setting the following: 

Rl =100KQ 

Ci = 4.7 uF 

V C c = 5 V 

V = V 1 = 1.5 V 

results in t = 167 ms. Therefore, the reset circuit of Figure 12-16 provides a 
low pulse of long enough duration to ensure the stabilization of the system os- 
cillator. 
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Note that if synchronization of multiple TMS320C3xs is required, all proces- 
sors should be provided with the same input clock and the same reset signal. 
After powerup, when the clock has stabilized, all processors may then be syn- 
chronized by generating a falling edge on the common reset signal. Because 
it is the falling edge of reset that establishes synchronization, reset must be 
high for at least ten H1 cycles initially. Following the falling edge, reset should 
remain low for at least ten H1 cycles and then be driven high. This sequencing 
of reset may be accomplished using additional circuitry based on either RC 
time delays or counters. 
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12.5 Serial Port Interface 



For applications such as modems, speech, control, instrumentation, and ana- 
log interface for DSPs, a complete analog-to-digital (A/D) and digital-to-analog 
(D/A) input/output system on a single chip may be desired. The TLC32044 
analog interface circuit (AIC) integrates a bandpass, switched-capacitor, anti- 
aliasing-input filter, 14-bit resolution A/D and D/A converters, and a lowpass, 
switched-capacitor, output-reconstruction filter, all on a single monolithic/ 
CMOS chip. The TLC32044 offers numerous combinations of master clock in- 
put frequencies and conversion/sampling rates, which can be changed via dig- 
ital signal processor control. 

Four serial port modes on the TLC32044 allow direct interface to TMS320C3x 
processors. When the transmit and receive sections of the AIC are operating 
synchronously, it can interface to two SN54299 or SN74299 serial-to-parallel 
shift registers. These shift registers can then interface in parallel to the 
TMS320C30, to other TMS320 digital processors, or to external FIFO circuitry. 
Output data pulses inform the processor that data transmission is complete or 
allow the DSP to differentiate between two transmitted bytes. A flexible control 
scheme is provided so that the functions of the AIC can be selected and ad- 
justed coincidentally with signal processing via software control. Refer to the 
TLC32044 data sheet for detailed information. 

When you interface the AIC to the TMS320C3x via one of the serial ports, no 
additional logic is required. This interface is shown in Figure 1 2-1 8. The serial 
data, control, and clock signals connect directly between the two devices, and 
the AlC's master clock input is driven fro m TCL KO, one of the TMS320C3x's 
internal timer outputs. The AlC's WORD/BYTE input is pulled high, selecting 
16-bit serial port transfers to optimize serial port data transfer rate. The 
TMS3 20C3x's XFO pin, configured as an output, is connected to the AlC's reset 
(RST) input to allowthe AIC to be reset by the TMS320C3x under program con- 
trol. This allows the TMS320C3x timer and serial port to be initialized before 
beginning conversions on the AIC. 
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Figure 12-18. AIC to TMS320C30 Interface 
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To provide the master clock input for the AIC, the TCLKO timer is configured 
to generate a clock signal with a 50% duty cycle at a frequency of f(H1)/4 or 
4.1 67 MHz. To accomplish this, the global control register for timer is set to 
the value 3C1h, which establishes the desired operating modes. The period 
register for timer is set to 1 , which sets the required division ratio for the H1 
clock. 

To properly communicate with the AIC, the TMS320C30 serial port must be 
configured appropriately. To configure the serial port, several TMS320C30 
registers and memory locations must be initialized. First, the serial port should 
be reset by setting the serial port global control register to 21 70300h. (The AIC 
should also be reset at this time. See description below of resetting the AIC via 
XFO). This resets the serial port logic and configures the serial port operating 
modes, including data transfer lengths, and enables the serial port interrupts. 
This also configures another important aspect of serial port operation: polarity 
of serial port signals. Because active polarity of all serial port signals is pro- 
grammable, it is critical to set appropriately the bits in the serial port global con- 
trol register that control this. In this application, all polarities are set to positive 
except FSX and FSR, which are driven by the AIC and are true low. 

The serial port transmit and receive control registers must also be initialized 
for proper serial port operation. In this application, both of these registers are 
set to 1 1 1 h, which configures all of the serial port pins in the serial port mode, 
rather than the general-purpose digital I/O mode. 

When the operations described above completed, interrupts are enabled, and 
provided that the serial port interrupt vector(s) are properly loaded, serial port 
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transfers may begin after the serial port is taken out of reset. This is accom- 
plished by loading E170300h into the serial port global controlregister. 

To begin conversion operations on the AIC and subsequent transfers of data 
on the serial port, the AIC is first reset by setting XFO to zero at the beginning 
of the TMS320C3x initialization routine. Setting XFO to zero is accomplished 
by setting the TMS320C3x IOF register to 2. This sets the AIC to a default con- 
figuration and halts serial port transfers and conversion operations until reset 
is set high. Once the TMS320C3x serial port and timer have been initialized 
as described above, XFO is set high by setting the IOF register to 6. This allows 
the AIC to begin operating in its default configuration, which in this application 
is the desired mode. In this mode, all internal filtering is enabled, sample rate 
is set at approximately 6.4 kHz, and the transmit and receive sections of the 
device are configured to operate synchronously. Conveniently, this mode of 
operation is appropriate for a variety of applications, and if a 5. 1 84-MHz master 
clock input is used, the default configuration results in an 8-kHz sample rate, 
which makes this device ideal for speech and telecommunications applica- 
tions. 

In addition to the benefit of a convenient default operating configuration, the 
AIC can also be programmed for a wide variety of other operating configura- 
tions. Sample rates and filter characteristics may be varied, in addition to 
which, numerous connections in the device may be configured to establish dif- 
ferent internal architectures, by enabling or disabling various functional 
blocks. 

To configure the AIC in a fashion different from the default state, the device 
must first be sent a serial data word with the two LSBs set to one. The two LSBs 
of a transmitted data word are not part of the transferred data information and 
are not set to one during normal operation. This condition indicates that the 
next serial transmission will contain secondary control information, not data. 
This information is then used to load various internal registers and specify in- 
ternal configuration options. There are four different types of secondary control 
words distinguished by the state of the two LSBs of the control information 
transferred. Note that each secondary control word transferred must be pre- 
ceded by a data word with the two LSBs set to one. 

The TMS320C3x can communicate with the AIC either synchronously or 
asynchronously, depending on the information in the control register. The oper- 
ating sequence for synchronous communication with the TMS320C30 shown 
in Figure 1 2-1 9 is as follows: 



1 ) The FSX or FSR pin is brought low. 

2) One 16-b it word is transmitted, or one 1 6-bit word is received. 

3) The FSX or F SR pin is brought high. 

4) The EODX or OEDR pin emits a low-going pulse. 
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Figure 12-19. Synchronous Timing of TLC32044 to TMS320C3x 
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For async hronous communication, the operating sequence is similar, but FSX 
and FSR do not occur at the same time (see Figure 1 2-20). After each receive 
and transmit operation, the TMS320C30 asserts an internal receive (RINT) 
and transmit (XINT) interrupt, which may be used to control program execu- 
tion. 



Figure 12-20. Asynchronous Timing of TLC32044 to TMS320C30 
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12.6 XDS1000 Target Design Considerations 



The TMS320C3x Emulator is an Extended Development System (XDS500 
and XDS1000) that uses a revolutionary technology to accomplish complete 
emulation via a serial scan path and has all the features necessary for full- 
speed emulation. To perform realtime emulation, you must provide a 12-pin 
header on the target system that is using the TMS320C3x. Refer to the 
TMS320C30 Emulator User's Guide and to the TMS320C30 Hewlett-Packard 
64776 Analysis Subsystem User's Guide tor a more complete description of 
theXDS500andXDS1000. 

To use the emulation connector of the XDS500, supply the signals shown in 
Figure 1 2-21 to a 1 2-pin header (two rows of six pins) with pin 8 cut out to pro- 
vide keying. Table 1 2-3 describes the pins and signals present on the header. 



Figure 12-21. 12-Pin Header Signals 



Header Dimensions: 

Pin-to-pin spacing: 0.1 00 inches (X,Y) 

Pin width: 0.025 inches square post 

Pin length: 0.235 inches nominal 
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t These signals should always be pulled up with separate 20-kfi resistors to +5 V on the 
TMS320C3X. 



Table 12-3. Signal Description 



Signal 


Description 


TMS320C30 
Pin Number 


TMS320C30 
Pin Number 


EMU0 


Emulation pin 


F14 


124 


EMU1 


Emulation pin 1 


E15 


125 


EMU2 


Emulation pin 2 


F13 


126 


EMU3 


Emulation pin 3 


E14 


123 


GND 


Ground 






H3 


TMS320C3X H3 


A1 


82 


PD 


Presence detect indicates that the cable is connected and the target system 
is powered up. It should be tied to +5 volts in the target system. 
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In addition to the signals required at the emulation connector, the EMU4 
through EMU6 signals on the TMS320C3x must also be appropriately con- 
nected to ensure proper emulation operation. The EMU4 signal should be tied 
through a pull-up resistor to +5 volts, and EMU5 and EMU6 must be left uncon- 
nected. Also, the RSVO through RSV10 signals must be tied through pull-up 
resistors to +5 volts as described in Chapter 13. 

The suggested parts and part numbers for the header are as follows: 

Qi DuPont Electronics straight header, unshrouded. Part number 
67996-112. 

□i DuPont Electronics right angle header, unshrouded. Part number 
68405-112. 

□ Amp right-angle header, four-wall shrouded. Part number 1 031 67-3. 

Figure 1 2-22 is a diagram of the typical setup for using the emulation connec- 
tion of the XDS1 000. 



Figure 12-22. Typical Setup for Using the Emulation Connection of the XDS1000 
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For unbuffered signals, the distance between the TMS320C30 emulation pins 
(EMUO, EMU1 , EMU2, EMU3, and H3) and the 1 2-pin header should be less 
than two inches. If that distance is more than two inches but less than six in- 
ches, the EMU3 and H3 signals should be buffered. The buffer should be non- 
inverting with a worst case propagation delay of 6.0 ns. For emulation pins-to- 
header distances greater than six inches, all emulation signals should be buff- 
ered. Recall that EMUO, EMU1 , and EMU2 are inputs, and EMU3 and H3 are 
outputs. The buffer should have the same characteristics as those given 
above. 
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12.7 Hewlett-Packard 64776 Analysis Subsystem Target Design 
Considerations 

The Hewlett-Packard 64776 analysis subsystem is a hardware product that 
can be used only with the TMS320C30 emulator (XDS500). The analysis sub- 
system captures TMS320C30-33 bus cycle information in real time and can 
react to the captured information with an action such as a hardware breakpoint. 
Refer to the TMS320C30 Hewlett-Packard 64776 Analysis Subsystem User's 
Guide for a more complete description of the Hewlett-Packard 64776 Analysis 
Subsystem. 

This subsection describes electrical information, timing specifications, and 
mechanical dimensions related to the analysis subsystem and the analysis 
subsystem connector pod. When designing your target system, take this infor- 
mation into consideration in order to ensure that the analysis subsystem can 
be used with your target system. 



12.7.1 System Overview 



Figure 1 2-23 illustrates a sample installation of an emulation system that uses 
the analysis subsystem. 

Q The TMS320C30 emulator is installed in a PC. 

Q The emulator target cable's 1 2-pin connector plugs into a header on the 
trace cable pod. 

Q The analysis subsystem communicates with the PC through a serial port; 
an RS-232 cable connects the analysis subsystem to the PC. 

□ The trace cable pod, which contains a TMS320C30-33 processor, inserts 
into the TMS320C30 socket on your target system. 



Figure 12-23. Installation Overview 
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12.7.2 Electrical Information 

12. 7.2. 1 Capacitance of the Electrical Probe 

The following lines will have approximately 25 picofarads of additional capaci- 
tance (1 pF for the FCT541 A inputs, 1 pF for the PC board trace, and 5 pF 
for the processor socket). 



RESE T INT0-INT3 

IACK XFO, XF1 

TCLKO, TCLK1 H1 , H3 clocks 

EMU0-EMU6 A23-A0 

D31-D0 R/W 

HOLDA XA12-XA0 

XD31- XD0 XR/W 

STRB IOSTRB 



MSTRB 



The remaining processor lines have only 5 pF additional capacitance because 
the only electrical connection for each is the socket. 



12.7.2.2 Power-Supply Loading of the Active Probe 

The additional current load is: 



Quantity 


Part 


Typ 


Max 


Typ Total 


Max Total 


Unit 


1 


TMS320C30 


,200 


600 


200 


600 


mA 


1 


16R47-7 


120 


180 


120 


180 


mA 


15 


FCT541A 


21.9 


42.7 


328.8 


640.5 


mA 



648.8 1420.5 mA 

Note: The probe uses 448 m A more than the processor typ and 820 m A more than the processor 
max. 

12.7.3 Timing Information 

The active probe requires that some of the target system signals have slightly 
better timing characteristics than those required by the TMS320C30-33 pro- 
cessor. 

Q Expansion and primary address buses (XA12-XA0, A23-A0). The 

same electrical/timing specifications stated in Chapter 13 apply to these 
lines. 

□ Expansion and primary data buses (XD31-X0, D31-D0). Chapter 1 3 
specifies that the hold time of the data on a read cycle must occur ns after 
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the rising edge of the strobe (IOSTRB, STRB, or MSTRB). Also, the strobe 
can fall and rise ns after the falling and rising edges of the H1 clock, 
respectively. Thus, if the data was not present ns after the rising edge 
of the clock, the TMS320C30 would still operate correctly. The analysis 
subsystem, however, requires a 5-ns hold time after the rising edge of H1 . 

Qi Expansion and primary read/write (XR/W, R/W). The same electrical/ 
timing specifications stated in Chapter 13 apply to these lines. 

□ H1 and H3. The same electrical/timing specifications stated in Chapter 1 3 
apply to these lines. 



RESET. While the analysis subsystem does load the reset line with some 
minor capacitance, it does not drive the line. 

EMU0-EMU6.The same electrical/timing specifications stated in Chapter 
13 apply to these lines. 



INT0-INT3. Requirements for levels and transitions are different. 

■ Requirements for levels on the interrupt lines: 

n The tck_clk=all setup time to falling H1 should be 1 8.5 ns, instead 
of 15 ns, to see the first edge. However, because interrupts must 
be low for a minimum of one full clock cycle, you will always see 
one low cycle. 

■ The tck_clk=bus_cycle setup time to falling H1 should be 15 ns. 

■ Requirements for transitions on the interrupt lines: 

n The tck_clk=bus_cycle setup time to falling H1 should be 15 ns. 

■ The tck_clk=all setup time to falling H1 should be 15 ns. 



IACK. The same electrical/timing specifications stated in Chapter 1 3 apply 
to these lines. 



HOLDA. The same electrical/timing specifications stated in Chapter 1 3 
apply to these lines. 

XFO, XF1, TCLKO, and TCLK1. Requirements for levels and transitions 
are different. 

H Requirements for levels on the XF and TCLK lines: 

■ The tck_clk=all setup time to falling H1 should be 1 8.5 ns instead 
of 12 ns. 

■ The tck_clk=bus_cycle setup time to falling H1 should be 12 ns. 
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■ Requirements for transitions on the XF and TCLK lines: 

■ The tck_clk=bus_cycle setup time to falling H1 should be 12 ns. 

■ The tck_clk=all setup time to falling H1 should be 12 ns. 



□i STRB, IOSTRB, and MSTRB. No special timing is required on the 
strobes. 



12.7.4 Mechanical Dimensions 



Figure 1 2-24 shows the mechanical dimensions for the analysis subsystem's 
connector to the target system. Additional sockets may be added to the pod/ 
connector to increase the height clearance, if desired. 



Figure 12-24. Analysis Subsystem Pod/Connector Dimensions 
(a) Top view 
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Notes: 1) All dimensions are in inches. 

2) The cable is approximately 38 inches long. The portion of the cable that attaches to the pod is not as flexible as the 
rest of the cable, and effectively adds approximately 1/2 inch to 1 inch to the total length of the pod. 



(b) Side view 
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Note: All dimensions are in inches. 
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12.8 TMS320C30 and TMS320C31 Differences 



This section addresses the major memory access differences between the 
TMS320C31 and the TMS320C30 devices. Observance of these consider- 
ations is critical for achieving design goal success. 

Table 12-4 shows these differences, which are detailed in the following sub- 
sections. 



Table 12-4. Feature Set Comparison 



Feature 


Device 


TMS320C31 


TMS320C30 


Data/program bus 


Primary bus: one bus composed 
of a 32-bit data and a 24-bit ad- 
dress bus 


Two buses: 

1 ) Primary bus: a 32-bit data and a 
24-bit address 

2) Expansion bus: a 32-bit data and a 
1 3-bit address 


Serial I/O ports 


1 serial port (SPO) 


2 serial ports (SPO, SP1) 


User program/data ROM 


Not available 


4K words/1 6K bytes 


Program boot loader 


User selectable 


Not available 



12.8.1 Data/Program Bus Differences 



The TMS320C31 uses only the primary bus and reserves the memory space 
that was previously used for expansion bus operations. 



Be careful! Program access to a reserved area produces unpredict- 
able results. 



12.8.2 Serial Port Differences 



Serial port 1 references in Section 8.2 of the TMS320C3x User's Guide are not 
applicable to the TMS320C31 . The memory locations identified for the asso- 
ciated control registers and buffers are reserved. 



12.8.3 Reserved Memory Locations 



Table 12-5 identifies TMS320C31 reserved memory locations in addition to 
those shown in Table 3-8. 
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Table 12-5. TMS320C31 Reserved Memory Locations 



Feature 


Device 


TMS320C31 


TMS320C30 


OxOOOOOO-OxOOOFFF 


Reservedt 


Microcomputer program/data ROM modet 


0x800000-0x801 FFF 


Reserved 


Expansion bus MSTRB space 


0x804000-0x805FFF 


Reserved 


Expansion bus IOSTRB space 


0x808050 


Reserved 


SP1 global-control register 


0x808052-0x808056 


Reserved 


SP1 local-control registers 


0x808058 


Reserved 


SP1 data-transmit buffer 


0x80805C 


Reserved 


SP1 receive-transmit buffer 


0x808060 


Reserved 


Expansion bus control register 



t Applies to the MCBL and MC modes only. 



12.8.4 Effects on the IF and IE Interrupt Registers 



The bits associated with serial port 1 in the IE (interrupt enable) register and 
the IF (interrupt flag) register for the TMS320C30 are not applicable to the 
TMS320C31 . Write only logic data to IE register bits 6, 7, 22, and 23 and to 
IF register bits 6 and 7. Writing logic 1s to these bits produces unpredictable 
results. 



12.8.5 User Program/Data ROM 



The user program/data ROM that is available for the TMS320C30 device does 
not exist for the TMS320C31. Rather, the memory locations that were allocated 
to support user program/data ROM operations have been reserved on the 
TMS320C31 to support microcomputer/boot loader accessing. See Chap- 
ter 3 for more information on using the microcomputer/boot loader function. 



12.8.6 Development Considerations 



For users who are developing application code using aTMS320C3x simulator, 
XDS, or ASM/LNK, Tl recommends that you modify the .dm and .cmd files by 
removing these memory spaces from the tool's configured memory. This en- 
sures that your developed application performs as expected when the 
TMS320C31 device is used. 
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Chapter 13 

TMS320C3x Signal Descriptions and 
Electrical Characteristics 



This chapter covers the TMS320C3x pinouts, signal descriptions, and 
electrical characteristics. 
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13.1 Pinout and Pin Assignments 



13.1.1 TMS320C30 Pinouts and Pin Assignments 



The TMS320C30 digital signal processor is available in a 181 -pin grid array 
(PGA) package. The pinout of this package is shown in Figure 13-1 and 
Figure 13-2. The pin assignments are listed in Table 13-1 and Table 13-2. 



Figure 13-1. TMS320C30 Pinout (Top View) 
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Figure 13-2. TMS320C30 Pinout (Bottom View) 
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• 


9 


9 


© 


© 


© 


© 


XD31 


cvss 


XD29 


XD24 


XD20 


XD17 


XD13 


XD10 


XD8 


XD4 


XD1 


TCLKO 


FSRO 


DX1 


FSX1 
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• 
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XD30 


XD26 


XD25 


XD21 


XD18 


XD15 


XD12 


XD11 


XD9 


XD6 


XD3 


XDO 


DXO 


FSXO 


DRO 


• 


• 


• 


© 


© 


• 


• 


© 


• 


• 


• 


© 


• 


• 


• . 



A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
R 
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Table 13-1. TMS320C30 Pin Assignments(by Function) (Figure 13-1 and Figure 13-2) 



Function 


Pin 




Function 


Pin 




Function 


Pin 




Function 


Pin 




Function 


Pin 


AO 
A1 
A2 
A3 
A4 


F15 
G12 
G13 
G14 
G15 




DO 
D1 
D2 
D3 
D4 


C4 
D5 
A2 
A3 
B4 




FSRO 

FSXO 

CLKRO 

CLKXO 

DRO 


P3 
R2 
N4 
M5 
R1 




XAO 
XA1 
XA2 
XA3 
XA4 


A13 
A14 
D11 
C12 
B13 




XDO 
XD1 
XD2 
XD3 
XD4 


R4 
P5 
N6 
R5 
P6 


A5 
A6 
A7 
A8 
A9 


H15 
H14 
J15 
J14 
J13 




D5 
D6 
D7 
D8 
D9 


C5 
D6 
A4 
B5 
C6 




DXO 

FSR1 

FSX1 

CLKR1 

CLKX1 


R3 
M3 
P1 
L4 
N2 




XA5 
XA6 
XA7 
XA8 
XA9 


A15 
B15 
C14 
E12 
D13 




XD5 
XD6 
XD7 
XD8 
XD9 


M7 
R6 
N7 
P7 
R7 


A10 
A11 
A12 
A13 
A14 


K15 
J12 
K14 
L15 
K13 




D10 
D11 
D12 
D13 
D14 


A5 
B6 
D7 
A6 
C7 




DR1 
DX1 

EMUO 
EMU1 


N1 
P2 

F14 
E15 




XA10 

XA11 

XA12 

RSVO 

RSV1 


C15 

D14 

E13 

J3 

J4 




XD10 
XD11 
XD12 
XD13 
XD14 


P8 
R8 
R9 
P9 
N9 


A15 
A16 
A17 
A18 
A19 


L14 
M15 
K12 
L13 
M14 




D15 
D16 
D17 
D18 
D19 


B7 
A7 
A8 
B8 
A9 




EMU2 

EMU3 

EMU4/SHZ 

EMU5 

EMU6 


F13 

E14 

F12 

C1 

M6 




RSV2 
RSV3 
RSV4 
RSV5 
RSV6 


K1 
K2 

L1 
K3 
L2 




XD15 
XD16 
XD17 
XD18 
XD19 


R10 

M9 

P10 

R11 

N10 


A20 
A21 
A22 
A23 
LOCATOR 


N15 
M13 
L12 
N14 
E5 




D20 
D21 
D22 
D23 
D24 


B9 

C9 

A10 

D9 

B10 




H1 
H3 

X1 
X2/CLKIN 


B3 
A1 

C2 

B1 




RSV7 
RSV8 
RSV9 
RSV10 

advdd 


K4 
M1 
L3 
M2 
D12 




XD20 
XD21 
XD22 
XD23 
XD24 


P11 

R12 

M10 

N11 

P12 


IACK 
INTO 
INT1 
INT2 
INT3 


G1 

H2 

H1 

J1 

J2 




D25 
D26 
D27 
D28 
D29 


A11 
C10 
B11 
A12 
D10 




TCLKO 
TCLK1 

XFO 
XF1 


P4 
N5 

G2 
G3 




advdd 

DDVdd 
DDV DD 

iodv dd 
iodvdd 


H11 

D4 

E8 

L8 

M12 




XD25 
XD26 
XD27 
XD28 
XD29 


R13 
R14 
M11 
N12 
P13 


MC/MP 

MSTRB 

RDY 

RESET 

R/W 


D15 

E3 

E1 

F1 

G4 




D30 
D31 


C11 
B12 

F3 
E2 




VBBP 

V SUBS 

V D D 

v D d 
v D d 


D3 
E4 
H4 
D8 
M8 




MDVdd 
PDVdd 

cvss 
cvss 
vss 


H5 

M4 

B2 

P14 

C8 




XD30 

XD31 

DV SS 

DV SS 

DV SS 


R15 

P15 

C3 

C13 

N3 


HOLD 
HOLDA 


STRB 
IOSTRB 


F2 
F4 




XRDY 
XR/W 


D2 
D1 




v D d 

VSS 


H12 
N8 




vss 
vss 


H3 
H13 




DV SS 

ivss 


N13 
B14 



Notes: 1) ADVdd. DDVdd. lODVpD- MDV DD , and PDVdd pins (D4, D12, E8, H5, H11, L8, M4, and M12) 
are on a common plane internal to the device. 

2 ) Vdd P' ns (D 8 > H4, H12, and M8) are on a common plane internal to the device. 

3) Vss- CVss- and IVsspins (B2, B14, C8, H3, H13, N8, and P1 4) are on a common plane internal 
to the device. 

4) DVss pins (C3, C13, N3, and N13) are on a common plane internal to the device. 
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Pinout and Pin Assignments 



Table 13-2. TMS320C30 Pin Assignments(Alphabetical) (Figure 13-1 and Figure 13-2) 



Notes: 



Signal 


Pin 




Signal 


Pin 




Signal 


Pin 




Signal 


Pin 




Signal 


Pin 


AO 
A1 
A2 
A3 
A4 


F15 
G12 
G13 
G14 
G15 




D8 

D9 

D10 

D11 

D12 


B5 
C6 
A5 
B6 
D7 




EMU5 

EMU6 

FSRO 

FSR1 

FSXO 


C1 

M6 
P3 
M3 
R2 




TCLK1 
V BBP 
V D D 

v D d 
v D d 


N5 
D3 
H4 
D8 
M8 




XD14 
XD15 
XD16 
XD17 
XD18 


N9 

R10 

M9 

P10 

R11 


A5 
A6 
A7 
A8 
A9 


H15 
H14 
J15 
J14 
J13 




D13 
D14 
D15 
D16 
D17 


A6 
C7 
B7 
A7 
A8 




FSX1 

H1 

H3 

HOLD 

HOLDA 


P1 
B3 
A1 
F3 
E2 




vdd 
vss 
vss 
vss 
vss 


H12 

N8 

C8 

H3 

H13 




XD19 
XD20 
XD21 
XD22 
XD23 


N10 

P11 

R12 

M10 

N11 


A10 
A11 
A12 
A13 
A14 


K15 
J12 
K14 
L15 
K13 




D18 
D19 
D20 
D21 
D22 


B8 
A9 
B9 
C9 
A10 




IACK 
INTO 
INT1 
INT2 
INT3 


G1 

H2 

H1 

J1 

J2 




VSUBS 

X1 

X2/CLKIN 

XAO 

XA1 


E4 

C2 

B1 

A13 

A14 




XD24 
XD25 
XD26 
XD27 
XD28 


P12 
R13 
R14 
M11 
N12 


A15 
A16 
A17 
A18 
A19 


L14 
M15 
K12 
L13 
M14 




D23 
D24 
D25 
D26 
D27 


D9 

B10 

A11 

C10 

B11 




IODVdd 

iodvdd 
ivss 

lOSTRB 
LOCATOR 


M12 

L8 

B14 

F4 

E5 




XA2 
XA3 
XA4 
XA5 
XA6 


D11 
C12 
B13 
A15 
B15 




XD29 

XD30 

XD31 

XFO 

XF1 


P13 

R15 

P15 

G2 

G3 


A20 
A21 
A22 
A23 
ADVdd 


N15 
M13 
L12 
N14 
■H11 




D28 
D29 
D30 
D31 


A12 
D10 
C11 
B12 




MC/MP 

MDV DD 
MSTRB 

PDVdd 
R/W 


D15 

H5 

E3 

M4 

G4 




XA7 

XA8 

XA9 

XA10 

XA11 


C14 
E12 
D13 
C15 
D14 




XRDY 
XR/W 


D2 
D1 


advdd 

CLKRO 
CLKR1 
CLKXO 
CLKX1 


D12 

N4 

L4 

M5 

N2 




DDVdd 

DDVdd 

DRO 

DR1 

DV SS 


E8 
D4 
R1 
N1 
N13 




RDY 

RESET 

RSVO 

RSV1 

RSV2 


E1 
F1 
J3 
J4 
K1 




XA12 

XDO 

XD1 

XD2 

XD3 


E13 

R4 

P5 

N6 

R5 








cv ss 
cvss 

DO 
D1 
D2 


B2 

P14 

C4 

D5 

A2 




DV SS 

DV SS - 

DV SS 

DXO 

DX1 


N3 

C3 

C13 

R3 

P2 




RSV3 
RSV4 
RSV5 
RSV6 
RSV7 


K2 
L1 
K3 
L2 
K4 




XD4 
XD5 
XD6 
XD7 
XD8 


P6 
M7 
R6 
N7 
P7 








D3 
D4 
D5 
D6 
D7 


A3 
B4 
C5 
D6 
A4 




EMUO 
EMU1 
EMU2 
EMU3 
EMU4/SH2 


F14 
E15 
F13 
E14 
F12 




RSV8 

RSV9 

RSV10 

STRB 

TCLKO 


M1 
L3 
M2 
F2 
P4 




XD9 

XD10 

XD11 

XD12 

XD13 


R7 
P8 
R8 
R9 
P9 









1) ADVdd. DDVdd- IODVp D , MDV dd , and PDV DD pins (D4, D12, E8, H5, H11, L8, M4, and M12) 
are on a common plane internal to the device. 

2 ) Vdd P' ns (D8, H4, H12, and M8) are on a common plane internal to the device. 

3) Vss.CVss. and IV$s pins (B2, B14, C8, H3, H1 3, N8, and P1 4) are on a common plane internal 
to the device. 

4 ) DVss P^s (C3, C13, N3, and N13) are on a common plane internal to the device. 
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13.1.2 TMS320C31 Pinouts and Pin Assignments 



The TMS320C31 device is packaged in a 132-pin plastic quad flat pack 
(PQFP) JDEC standard package. Figure 1 3-3 shows the pinouts for this pack- 
age, Figure 1 3-5 on page 1 3-1 6 shows the mechanical layout, and Table 1 3-3 
shows the associated pin assignments alphabetically; Table 1 3-4 shows the 
associated pin assignments numerically. 



Figure 13-3. TMS320C31 Pinout (Top View) 
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Table 13-3. TMS320C31 Pin Assignments (Alphabetical) (Figure 13-3) 



Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


AO 
A1 
A2 
A3 
A4 


29 
28 
27 
26 
25 


D4 
D5 
D6 
D7 
D8 


76 
75 
73 
72 
68 


EMUO 
EMU1 
EMU2 
EMU3 
FSRO 


124 
125 
126 
123 
110 


Vdd 
vdd 
vdd 
vdd 
vdd 


40 
49 
59 
65 
66 


vss 
vss 
vss 
vss 
vss 


84 

85 

86 

101 

102 


A5 
A6 
A7 
A8 
A9 


23 
22 
21 
20 
18 


D9 

D10 

D11 

D12 

D13 


67 
64 
63 
62 
60 


FSXO 

HOLD 

HOLDA 

H1 

H3 


114 

90 

89 

81 

82 


vdd 
vdd 
vdd 
vdd 
vdd 


74 
83 
91 
97 

104 


vss 
vss 
vss 
vss 
vss 


109 
113 
117 
119 
128 


A10 
A11 
A12 
A13 
A14 


16 
14 
13 
12 
11 


D14 
D15 
D16 
D17 
D18 


58 
56 
55 
54 
53 


IACK 

INTO 

INT1 

INT2 

INT3 


99 

100 

103 

106 

107 


vdd 
vdd 
vdd 
vdd 
vdd 


105 
115 
121 
131 
132 


X1 

X2/CLKIN 
XFO 
XF1 


88 
87 
96 
98 


A15 
A16 
A17 
A18 
A19 


10 

9 

8 

7 
5 


D19 
D20 
D21 
D22 
D23 


52 
50 
48 
47 
46 


MCBUMP 

R/W 

RDY 

RESET 

SHZ 


127 

94 

92 

95 

118 


vss 
vss 
vss 
vss 
vss 


3 

4 

17 

19 

30 






A20 
A21 
A22 
A23 
CLKRO 


2 

1 

130 

129 

111 


D24 
D25 
D26 
D27 
D28 


45 
44 
43 
41 
39 


STRB 

TCLKO 

TCLK1 


93 

120 

122 


vss 
vss 
vss 
vss 
vss 


35 
36 
37 
42 
51 






CLKXO 

DO 

D1 

D2 

D3 


112 

80 

79 

78 

77 


D29 
D30 
D31 
DRO 
DXO 


38 

34 

31 

108 

116 


V D D 

Vdd 
v D d 
vdd 
Vdd 


6 

15 

24 

32 

33 


vss 
vss 
vss 
vss 
vss 


57 
61 
69 
70 
71 
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Table 13-4. TMS320C31 Pin Assignments (Numerical) (Figure 13-3) 



Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


Pin 


Signal 


1 
2 
3 
4 
5 


A21 
A20 

vss 
vss 

A19 


31 
32 
33 
34 
35 


D31 

vdd 
vdd 

D30 

vss 


61 
62 
63 
64 
65 


v S s 

D12 
D11 
D10 

v D d 


91 
92 
93 
94 
95 


Vnn 

RDY 

STRB 

R/W 

RESET 


121 
122 
123 
124 
125 


VDD 

TCLK1 

EMU3 

EMUO 

EMU1 


6 
7 
8 
9 
10 


vdd 

A18 
A17 
A16 
A15 


36 
37 
38 
39 
40 


vss 
vss 

D29 
D28 
VDD 


66 
67 
68 
69 
70 


vdd 

D9 
D8 

vss 
vss 


96 
97 
98 
99 
100 


XFO 

VDD 
XF1 
IACK 
INTO 


126 
127 
128 
129 
130 


EMU2 
MCBL/MP 

vss 

A23 
A22 


11 
12 
13 
14 
15 


A14 
A13 
A12 
A11 

vdd 


41 
42 
43 
44 
45 


D27 

vss 

D26 
D25 
D24 


71 
72 
73 
74 
75 


vss 

D7 
D6 

Vdd 

D5 


101 
102 
103 
104 
105 


vss 
Vss 

INT1 

vdd 
vdd 


131 
132 


vdd 
vdd 


16 
17 
18 
19 
20 


A10 

vss 

A9 

vss 

A8 


46 
47 
48 
49 
50 


D23 
D22 
D21 

VDD 
D20 


76 
77 
78 
79 
80 


D4 
D3 
D2 
D1 
DO 


106 
107 
108 
109 
110 


INT2 
INT3 
DRO 

vss 

FSRO 






21 
22 
23 
24 
25 


A7 
A6 
A5 

vdd 

A4 


51 
52 
53 
54 
55 


vss 

D19 
D18 
D17 
D16 


81 
82 
83 
84 
85 


H1 
H3 
VDD 

Vss 
vss 


111 
112 
113 ' 
114 
115 


CLKRO 
CLKXO 

vss 

FSXO 

vdd 






26 
27 
28 
29 
30 


A3 
A2 
A1 
A0 

vss 


56 
57 
58 
59 
60 


D15 

vss 

D14 

vdd 

D13 


86 
87 
88 
89 
90 


vss 

X2/CLKIN 
X1 

HOLDA 
HOLD 


116 
117 
118 
119 
120 


DXO 

Vss 

SHZ 

vss 

TCLKO 
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Signal Descriptions 



13.2 Signal Descriptions 

13.2.1 TMS320C30 Signal Descriptions 



Table 1 3-5 describes the signals that the TMS320C30 device uses in the mi- 
croprocessor mode. They are listed according to the signal name; the number 
of pins allocated; the input (I), output (O), or high-impedance state (Z) operat- 
ing modes; a brief description of the signal's function; and the condition that 
places a n output pin in high impedance. A line over a signal name (for example, 
RESET) indicates that the signal is active low (true at a logic level). Pins la- 
beled NC are not to be connected by the user. The signals are grouped accord- 
ing to function. 



Table 13-5. TMS320C30 Signal Descriptions 



Signal 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Primary Bus Interface (61 Pins) 


D31-D0 


32 


l/O/Z 


32-bit data port of the primary bus interface. 


S H R 


A23-A0 


24 


o/z 


24-bit address port of the primary bus interface. 


S H R 


R/W 


1 


o/z 


Read/write signal for primary bus interface. This pin is 
high when a read is performed and low when a write is 
performed over the parallel interface. 


S H R 


STRB 


1 


o/z 


External access strobe for the primary bus interface. 


S H 


RDY 


1 


I 


Ready signal. This pin indicates that the external device 
is prepared for a primary bus interface transaction to 
complete. 


S 


HOLD 


■ 1 


I 


Hold signal for primary bus interface. When HOLD is a 
logic low, any ongoing transaction is completed. The 




A23-A0, D31-D0, STRB, and R/W signals are placed 
in a high-impedance state, and all transactions over the 


primary bus interface are held until HOLD becomes a 
logic high or the NOHOLD bit of the primary bus control 
register is set. 


HOLDA 


1 


o/z 


Hold acknowledge signal for primary bus interface. This 
signal is generated in response to a loqic low on HOLD, 
it signals that A23-A0, D31-D0, STRB, and R/W are 
placed in a high-impedance state and that all transac- 
tions over the bus will be held. HOLDA will be high in re- 
sponse to a logic high of HOLD or when the NOHOLD 
bit of the primary bus control register is set. 


S 


Expansion Bus Interface (49 Pins) 


XD31-XD0 


32 


l/O/Z 


32-bit data port of the expansion bus interface. 


S R 


XA12-XA0 


13 


O/Z 


13-bit address port of the expansion bus interface. 


S R 


XR/W 


1 


o/z 


Read/write signal for expansion bus interface. When a 
read is performed, this pin is held high; when a write is 
performed, this pin is low. 


S R 


MSTRB 


1 


o/z 


External memory access strobe for the expansion 
bus interface. 


S 



t Input (I), output (0), high-impedance state (Z). 
T- S = SHZ active, H = Hold active, R = Reset active. 
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Table 13-5. TMS320C30 Signal Descriptions (Continued) 



Signal 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Expansion Bus Interface (49 Pins) (Concluded) 


IOSTRB 


1 


o/z 


External I/O access strobe for the expansion bus inter- 
face. 


S 


XRDY 


1 


I 


Ready signal. This pin indicates that the external device 
is prepared for an expansion bus interface transaction 
to complete. 




Control Signals (9 Pins) 


RESET 


1 


I 


Reset. When this pin is a logic low, the device is placed 
in the reset condition. After reset becomes a logic high, 
execution begins from the location specified by the re- 
set vector. 




INT3-1NT0 


4 


I 


External interrupts. 




IACK 


1 


o/z 


Interrupt acknowledge signal. IACK is set to by the 
IACK instruction. This can be used to indicate the begin- 
ning or end of an interrupt service routine. 


S 


MC/MP 


1 


I 


Microcomputer/microprocessor mode pin. 




XF1.XF0 


2 


l/O/Z 


External flag pins. They are used as general-purpose 
I/O pins or to support interlocked processor instruc- 
tions. 


S R 


Serial Port Signals (6 Pins) 


CLKXO 


1 


l/O/Z 


Serial port transmit clock. This pin serves as the serial 
shift clock for the serial port transmitter. 


S R 


DXO 


1 


l/O/Z 


Data transmit output. Serial port transmits serial data 
on this pin. 


S R 


FSXO 


1 


l/O/Z 


Frame synchronization pulse for transmit. The FSXO 
pulse initiates the transmit data process over pin DXO. 


S R 


CLKRO 


1 


l/O/Z 


Serial port receive clock. This pin serves as the serial 
shift clock for the serial port receiver. 


S R 


DRO, 


1 


l/O/Z 


Data receive. Serial port receives serial data via the 
DRO pin. 


S R 


FSRO 


1 


l/O/Z 


Frame synchronization pulse for receive. The FSRO 
pulse initiates the receive data process over DRO. 


S R 


Serial Port 1 Signals (6 Pins) 


CLKX1 


1 


l/O/Z 


Serial port 1 transmit clock. This pin serves as the serial 
shift clock for the serial port 1 transmitter. 


S R 


DX1 


1 


l/O/Z 


Data transmit output. Serial port 1 transmits serial data 
on this pin. 


S R 


FSX1 


1 


l/O/Z 


Frame synchronization pulse for transmit. The FSX1 
pulse initiates the transmit data process over pin DX1 . 


S R 


CLKR1 


1 


l/O/Z 


Serial port 1 receive clock. This pin serves as the serial 
shift clock for the serial port 1 receiver. 


S R 



t Input (I), output 
$ S = SHZ active, 



(O), high-impedance state (Z). 
H = Hold active, R = Reset active. 
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Table 13-5. TMS320C30 Signal Descriptions (Continued) 



Signals 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Serial Port 1 Signals (6 Pins) (Concluded) 


DR1 


1 


l/O/Z 


Data receive. Serial port 1 receives serial data via the 
DR1 pin. 


S R 


FSR1 


1 


l/O/Z 


Frame synchronization pulse for receive. The FSR1 
pulse initiates the receive data process over DR1 . 


S R 


Timer Signals (1 Pin) 


TCLKO 


1 


l/O/Z 


Timer clock. As an input, TCLKO is used by timer to 
count external pulses. As an output pin, TCLKO outputs 
pulses generated by timer 0. 


S R 


Timer 1 Signals (1 Pin) 


TCLK1 


1 


l/O/Z 


Timer clock. As an input, TCLK1 is used by timer 1 to 
count external pulses. As an output pin, TCLK1 outputs 
pulses generated by timer 1 . 


S R 


Supply and Oscillator Signals (29 Pins) 


VDD3 — VDDO 


4 




Four +5-V supply pins. § 




iodvddi.iodvddo 


2 




Two +5-V supply pins. § 




advdd-i.advddo 


2 




Two +5-V supply pins. § 




PDV DD 


1 




One +5-V supply pin. § 




DDV DD1 ,DDV DD0 


2 




Two +5-V supply pins. § 




MDV DD 


1 




One +5-V supply pin. § 




V SS3 — v SS0 


4 




Four ground pins. 




DVSS3— DVsso 


4 




Four ground pins. 




cvssi.cvsso 


2 




Two ground pins. 




"Vss 


1 




One ground pin. 




V BBP 


1 


NC 


Vbb pump oscillator output. 




VSUBS 


1 


I 


Substrate pin. Tie to ground. 




X1 


1 


o/z 


Output pin from the internal oscillator for the crystal. If 
a crystal is not used, this pin should be left uncon- 
nected. 


S 


X2/CLKIN 


1 


I 


Input pin to the internal oscillator from the crystal or a 
clock. 




H1 


1 


o/z 


External H1 clock. This clock has a period equal to twice 
CLKIN. 


S 


H3 


1 


o/z 


External H3 clock. This clock has a period equal to twice 
CLKIN. 


S 


Reserved (18 Pins) fl 


EMU2 — EMUO 


3 


I 


Reserved. Use pull-ups to +5 volts. See Section 12.6. 




EMU3 


1 


o/z 


Reserved. See Section 12.6. 


S 



t Input (I), output (O), high-impedance state (Z). 
$ S = SHZ active, H = Hold active, R = Reset active. 
§ Recommended decoupling capacitor is 0.1 |iF. 

11 Follow the connections specified for the reserved pins. 1 8- to 22-kQ pull-up resistors are recommended. All +5 volt supply 
pins must be connected to a common supply plane, and all ground pins must be connected to a common ground plane. 
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Table 13-5. TMS320C30 Signal Descriptions (Concluded) 



Signals 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Reserved (18 Pins) 11 (Concluded) 


EMU4/SHZ 


1 


I 


Shutdown high impedance. An active low shuts down 
the TMS320C30 and places all pins in a high-impe- 
dance state. This signal is used for board-level testing 
to ensure that no dual drive conditions occur. CAU- 
TION: An active low on the SHZ pin corrupts 
TMS320C30 memory and register contents. Reset the 
device with an SHZ=1 to restore it to a known operating 
condition. 




EMU6, EMU5 


2 


NC 


Reserved. 




RSV10 — RSVO 


11 


I 


Reserved. Use pull-ups on each pin to +5 volts. 




Locator (1 Pin) 


Locator 


1 


NC 


Reserved. See Figure 13-1 and Table 13-1. 





t Input (I), output (O), high-impedance state (Z). 
+■ S = SHZ active, H = Hold active, R = Reset active. 

K Follow the connections specified for the reserved pins. 18- to 22-kQ pull-up resistors are recommended. All +5 volt supply 
pins must be connected to a common supply plane, and all ground pins must be connected to a common ground plane. 

13.2.2 TMS320C31 Signal Descriptions 

Table 1 3-6 describes the signals that the TMS320C31 device uses in the mi- 
croprocessor mode. They are listed according to the signal name; the number 
of pins allocated; the input (I), output (O), or high-impedance state (Z) operat- 
ging modes; a brief description of the signal's function; and the condition that 
places a n output pin in high impedance. A line over a signal name (for example, 
RESET) indicates that the signal is active low (true at a logic level). 

Table 13-6. TMS320C31 Signal Descriptions 



Signal 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Primary Bus Interface (61 Pins) 


D31-D0 


32 


l/O/Z 


32-bit data port. 


S H R 


A23-A0 


24 


O/Z 


24-bit address port.. 


S H R 


R/W 


1 


O/Z 


Read/write signal. This pin is high when a read is per- 
formed; low when a write is performed over the parallel 
interface. 


S H R 


STRB 


1 


O/Z 


External access strobe. 


S H 


RDY 


1 


I 


Ready signal. This pin indicates that the external device 
is prepared for a transaction completion. 




HOLD 


1 


I 


Hold signal. When HOLD is a logic low, any ongoing 
transaction is completed. The A23-A0, D31-D0, 
STRB, and RAA/ signals are placed in a high-impedance 
state, and all transactions over the primary bus inter- 
face are held until HOLD becomes a logic high, or the 
NOHOLD bit of the primary bus control register is set. 





t Input (I), output (O), high-impedance (Z)state. 
$ S = SHZ active, H = Hold active, R = Reset active. 
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Table 13-6. TMS320C3 1 Signal Descriptions (Continued) 



Signal 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Primary Bus Interface (61 Pins) (Concluded) 


HOLDA 


1 


o/z 


Hold acknowledge signal. This signal is generated in re- 
sponse to a logic low on HOLD. It signals that A23-A0, 
D31-D0, STRB, and R/W are placed in a high-impe- 
dance state and that all transactions over the bus will be 
held. HOLDA will be high in response to a logic high of 
HOLD, or the NOHOLD bit of the primary bus control 
register is set. 


S 


Control Signals (10 Pins) 


RESET 


1 


I 


Reset. When this pin is a logic low, the device is placed 
in the reset condition. When reset becomes a logic 1 , 
execution begins from the location specified by the re- 
set vector. 




INT3 — INTO 


4 


I 


. External interrupts. 




IACK 


1 


o/z 


Interrupt acknowledge signal. IACK is set to 1 by the 
IACK instruction. This can be used to indicate the begin- 
ning or end of an interrupt service routine. 


S 


MCBL/MP 


1 


I 


Microcomputer boot loader/microprocessor mode pin. 




SHZ 


1 


I 


Shut down high Z. An active low shuts down the 
TMS320C31 and places all pins in a high-impedance 
state. This signal is used for board-level testing to en- 
sure that no dual drive conditions occur. CAUTION : An 
active low on the SHZ pin corrupts TMS320C31 
memory and register contents. Reset the device with an 
SHZ = 1 to restore it to a known operating condition. 




XF1.XF0 


2 


l/O/Z 


External flag pins. They are used as general-purpose 
I/O pins or to support interlocked processor instruc- 
tions. 


S R 


Serial Port Signals (6 Pins) 


CLKRO 




l/O/Z 


Serial port receive clock. This pin serves as the serial 
shift clock for the serial port receiver. 


S R 


CLKXO 




l/O/Z 


Serial port transmit clock. This pin serves as the serial 
shift clock for the serial port transmitter. 


S R 


DRO 




l/O/Z 


Data receive. Serial port receives serial data via the 
DRO pin. 


S R 


DXO 




l/O/Z 


Data transmit output. Serial port transmits serial data 
on this pin. 


S R 


FSRO 




l/O/Z 


Frame sychronization pulse for receive. The FSRO 
pulse initiates the receive data process over DRO. 


S R 


FSXO 




l/O/Z 


Frame synchronization pulse for transmit. The FSXO 
pulse initiates the transmit data process over pin DXO. 


S R 



t Input (I), output (O), high-impedance state (Z). 
$ S = SHZ active, H = Hold active, R = Reset active. 
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Table 13-6. TMS320C31 Signal Descriptions (Concluded) 






Signal 


#Pins 


l/O/Zt 


Description 


Condition When 
Signal Is in High Z* 


Timer Signals (2 Pins) 


TCLKO 


1 


l/O/Z 


Timer clock 0. As an input, TCLKO is used by timer to 
count external pulses. As an output pin, TCLKO outputs 
pulses generated by timer 0. 


S 


TCLK1 


1 


l/O/Z 


Timer clock 1 . As an input, TCLKO is used by timer 1 to 
count external pulses. As an output pin, TCLK1 outputs 
pulses generated by timer 1 . 


s 


Supply and Oscillator Signals (49 Pins) 


H1 


1 


o/z 


External H 1 clock. This clock has a period equal to twice 
CLKIN. 


s 


H3 


1 


o/z 


External H3 clock. This clock has a period equal to twice 
CLKIN. 


s 




VDD 


20 


I 


+5-Vdc supply pins. All pins must be connected to a 
common supply plane. § 




vss 


25 


I 


Ground pins. All ground pins must be connected to a 
common ground plane. 




X1 


1 


o/z 


Output pin from the internal crystal oscillator. If a crystal 
is not used, this pin should be left unconnected. 


s 


X2/CLKIN 


1 


I 


The internal oscillator input pin from a crystal or a clock. 




Reserved (4 Pins) 11 


EMU2 — EMUO 


3 


I 


Reserved. Use 20-kQ pull-up resistors to +5 volts. 




EMU3 


1 


o/z 


Reserved. 


s 



t Input (I), output (O), high-impedance state (Z). 
$ S = SHZ active, H = Hold active, R = Reset active. 
§ Recommended decoupling capacitor value is 0.1 \\F. 

H Follow the connections specified for the reserved pins. 1 8- to 22-kQ pull-up resistors are recommended. All +5 volt supply 
pins must be connected to a common supply plane, and all ground pins must be connected to a common ground plane. 
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13.3 TMS320C3x Mechanical Data 

Figure 13-4. TMS320C30 181-Pin PGA Dimensions 



PinA1 

Corner 

Indicator 




40,4 (1.590) 
37, 6 (1 .480) 



40,4 (1.590) 



37,6 (1.480) 



Thermal Resistance Characteristics 



Parameter 


°C/W 


Air Flow 
LFPM 


R0JC 


3.8 


N/A 


R 0JA 


21.6 





R0JA 


13.3 


200 


R0JA 


9.7 


400 


R0JA. 


8.3 


600 


R0JA 


7.4 


800 


R0JA 


7.1 


1000 



4,95 (0.195) _ 
2,54 (0.100) 1 



i_t 



4,06 (0.160) 



2,54 (0.100) 



0,508 (0.020) 
0,406 (0.016) 
(181 Places) 



1,40 (0.055) 
r 1,14 (0.045) 



jy 7 



,27 (0.050) Norn 
Dia (4 Places) 



2,54(0.100) 



T.P.-WW- 

— R|S®® 



35,6(1.40) 
REF 



®®®®®®®®®® ®(© 
P ®®®®®®®®®®®®®®® 
N ®®®®®®®®®®®®®®® 
M ®®®®®®®®®®®®®®® 
L ®®®® ® ®®®® 
K ®®®® Bottom ®®®® 
J ®®®® View ®®®® 
H ®®®®® ®®®®® 
G ® ® ® ® r- Locator ® ® ® ® 
F ®®®® * ®®®® 

E ®®®®® ® ®®®® 
D ®®®®®®®®®®®®®®® 
C ®®®®®®®®®®®®®®® 
B ®®®®®®®®®®®®®®®- 

A ^!®®®®®®®®®®®®®s 



i" 



2,54 (0.100) TYP 



Index Corner- 



1 23456789 101112131415 



1 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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Figure 13-5. TMS320C3 1 132-Pin Plastic Quad Flat Pack 



0,254 (0.01 0) Norn 
0,635 (0.025) Norn -frl k- 



4,45 (0.175) 
4,19 (0.165) 



W — N 



0,76 (0.030) Nom 
i 



24,18(0.952) r= 
24,08 (0.948) = 



27,56(1.085) 
27,31 (1 .075) 



24,18(0.952) 
24,08 (0.948) 

27,56(1.085) 
27,31 (1 .075) 



Thermal Resistance Characteristics 



Parameter 


°c/w 


Air Flow 
LFPM 


R 0JC 


11.0 


N/A 


R0JA 


55.1 





R©JA 


TBD 


200 


R0JA 


TBD 


400 


R 0JA 


TBD 


600 


R©JA 


TBD 


800 


R0JA 


TBD 


1000 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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13.4 Electrical Specifications 

Table 13-7. Absolute Maximum Ratings o ver Specified Temperature Range 



Condition/Characteristic 


TMS320C30/TMS320C31 Range 


Supply voltage range, Vqd 


- 0.3 V to 7 V 


Input voltage range 


- 0.3 V to 7 V 


Output voltage range 


-0.3 V to 7 V 


Continuous power dissipation 


3.15 W for TMS320C30 

1 .7 W for TMS320C31 

(See Note 3) 


Operating case temperature range 


0°Cto85°C 


Storage temperature range 


-55°Cto150°C 



Notes: 1 ) Stresses beyond those listed under "Absolute Maximum Ratings" may cause permanent damage to the device. This 
is a stress rating only; functional operation of the device at these or any other conditions beyond those indicated 
in the "Recommended Operating Conditions" section of this specification is not implied. Exposure to absolute-maxi- 
mum-rated conditions for extended periods may affect device reliability. 

2) All voltage values are with respect to Vss- 

3) Actual operating power will be less. This value was obtained under specially produced worst-case test conditions, 
which are not sustained during normal device operation. These conditions consist of continuous parallel writes of 
a checkerboard pattern to both primary and expansion buses at the maximum rate possible. See nominal Odd) cur ~ 
rent specification in Table 1 3-8 and also read Calculation of TMS320C30 Power Dissipation, Appendix E. 



Table 13-8. Recommended Operating Conditions 








Parameter 


Min 


Nom Max 


Unit 


Vqd Supply voltages (DDVdd. etc -) 


4.75 


5 5.25 


V 


Vss Supply voltages (CVss. © tc -) 





V 


V|h High-level input voltage 


2 


VDD + 
0.3t 


V 


V|l Low-level input voltage 


-0.3t 


0.8 


V 


lOH High-level output current 


-300 


uA 


lOL Low-level output current 


2 


mA 


T Operating case temperature 





85 


°C 


V TH CLKIN high-level input voltage for CLKIN 


2.6 


V D D + 0.3t 


V 



t Guaranteed from characterization but not tested. 

Note: Note 1 for Table 1 3-7 also applies to this table. All input and output voltages except for CLKIN are TTL-compatible. CLKIN 
may be driven by a CMOS clock. 
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Table 13-9. Electrical Characteristics Over Specified Free-Air Temperature Range 



Electrical Characteristic 


Min 


Nom 


Max 


Unit 


VOH 


High-level output voltage 
( V D d = Min, Ioh = Max) 


2.4 


3 




V 


vol 


Low-level output voltage ( Vqd = Min, 
lOL = Max) 




0.3 


0.6 


V 


iz 


Three-state current ( Vqd = Max) 


-20 




20 


HA 


"i 


Input current ( V| = Vss t° V DD) 


-10 




10 


HA 


iip 


Input current ( Inputs with internal pull-ups) 
(See Note 4) 


-400 




20 


uA 


ice 


Supply current ( Ta = 25 ° C.Vqd = Max, 
f x = Max) (See Note 5) 


TMS320C30-33 
TMS320C30-27 
TMS320C30-40 
TMS320C31-33 
TMS320C31-27 




200 
175 
250 
150 
125 


600 
500 
TBD 
325 
250 


mA 


C| 


Input capacitance (except CLKIN) 


15* 


pF 


CLKIN 


25 


Co 


Output capacitance 


20* 


PF 



* Guaranteed by design but not tested. 

Notes: 1 ) All nominal values are at Vqd = 5 V, T/\ = 25°C. ' 

2) f x is the input clock frequency. The maximum value is 40 MHz. 

3) All input and output voltage levels are TTL compatible. 

4) Pins with internal pull-up devices: INT3 — INTO, MC/MP, RSV10 — RSV0. Although RSV1 0-RSV0 have internal 
pullup devices, external pullups should be used on each pin as described in Table 13-5, page 13-12. 

5) Actual operating current will be less than this maximum value. This value was obtained under specially produced 
worst-case test conditions, which are not sustained during normal device operation. These conditions consist of con- 
tinuous parallel writes of a checkerboard pattern to both primary and expansion buses at the maximum rate possible. 
See Calculation of TMS320C30 Power Dissipation, Appendix E. 



Figure 13-6. Test Load Circuit 



Tester Pin 
Electronics 



v Load 




Output 
o Under 
Test 



Where: Iol = 2 - n mA (all outputs) 

lOH = 300 ^A (all outputs) 
VLoad = 2.15 V 
Cj = 80 pF typical load circuit capacitance. 
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13.5 Signal Transition Levels 



13.5.1 TTL-Level Outputs 



TTL-compatible output levels are driven to a minimum logic-high level of 2.4 
volts and to a maximum logic-iow level of 0.6 volt. Figure 1 3-7 shows the TTL- 
level outputs. 



Figure 13-7. TTL-Level Outputs 



TTL-output transition times are specified as follows: 

□ For a high-to-low transition, the level at which the output is said to be no 
longer high is 2.0 volts, and the level at which the output is said to be low 
is 1.0 volt. 

Ql For a low-to-high transition, the level at which the output is said to be no 
longer low is 1 .0 volt, and the level at which the output is said to be high 
is 2.0 volts. 



13.5.2 TTL-Level Inputs 

Figure 13-8. TTL-Level Inputs 



2.0 V 
90% 



10% 
0.8 V 



TTL-compatible input transition times are specified as follows: 

Q For a high-to-low transition on an input signal, the level at which the input 
is said to be no longer high is 2.0 volts, and the level at which the input is 
said to be low is 0.8 volt. 

Q For a low-to-hightransition on an input signal, the level at which the input 
is said to be no longer low is 0.8 volt, and the level at which the input is said 
to be high is 2.0 volts. 

Figure 13-8 shows the TTL-level inputs. 
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13.6 Timing 



Timing specifications apply to the TMS320C30 and TMS320C31 . 



13.6.1 X2/CLKIN, H1, and H3 Timing 



Table 13-10 defines the timing parameters fortheCLKIN, H1 , and H3 interface 
signals. The numbers shown in parentheses in Figure 1 3-9 and Figure 1 3-1 
correspond with those in the No. column of Table 13-10. 



Figure 13-9. Timing for X2/CLKIN 



X2/CLKIN 



Figure 13-10. Timing for H1/H3 
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Table 13-10. Timing Parameters for CLKIN, H1, and H3 (Figure 13-9 and Figure 13-10) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


tffCI) 


CLKIN fall time 


6* 


5* 


5* 


ns 


(2) 


tw(CIL) 


CLKIN low pulse duration 
tc(CI) = min 


14 


10 


9 


ns 


(3) 


MCIH) 


CLKIN high pulse duration 
tc(Ci) = min 


14 


10 


9 


ns 


(4) 


V(CI) 


CLKIN rise time 


6* 


5* 


5* 


ns 


(5) 


ten 


CLKIN cycle time 


37 303 


30 303 


25 303 


ns 


(6) 


tf(H) 


H1/H3 fall time 


4 


3 


3 


ns 


(7) 


tw(HL) 


H1/H3 low pulse duration 


P-6 


P-6 


P-5 


ns 


(S) 


tw(HH) 


H1/H3 high pulse duration 


P-7 


P-7 


P-6 


ns 


0) 


V(H) 


H1/H3 rise time 


5 


4 


3 


ns 


(9.1) 


td(HL-HH) 


Delay from H1 (H3) low to 
H3(H1)high 


Ot 6 


0t 5 


0t 4 


ns 


(10) 


WH) 


H1/H3 cycle time 


74 606 


60 606 


50 606 


ns 



t Guaranteed from characterization but not tested, 
t Guaranteed by design but not tested. 
Note: P = t c (d) 
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13.6.2 Memory Read/Write Timing 



Table 13-11 defines memory read/write timing parameters. The numbers 
shown in parentheses in Figure 13-11 and Figure 13-12 correspond with 
those in the No. column of Table 1 3-1 1 . 



Figure 13-11. Timing for Memory ( (M)STRB = 0) Read 

H3 



V_AA 



H1 



(M)STRB 



H-02) 



op -»j w- ! -*| 



(X)R/W 



X 



(14.1/14.2) 



(X)A 



X 



X 



(13.1/13.2) 



X 



| I — S S— (15.1/15.2) 
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Table 13- 1 1. Timing Parameters for a Memory ( (M)STRB) = 0) Read/Write (Figure 13- 1 1 and Figure 13-12) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min 


Max 


Min 


Max 


Min 


Max 


(11) 


td(H1 L-(M)SL) 


H1 low to (M)STRB low delay 


0* 


13 


0* 


10 


0* 


6 


ns 


(12) 


*d(H1L-(M)SH) 


H1 low to (M)STRB high delay 


0* 


13 


0* 


10 


0* 


6 


ns 


(13.1) 


td(HIH-RWL) 


H1 high to R/W low delay 


0* 


13 


0* 


10 


0* 


9 


ns 


(13.2) 


td(HIH-XRWL) 


H1 high to XR/W low delay 


0* 


19 


0* 


15 


0* 


13 


ns 


(14.1) 


td(H1L-A) 


H1 low to A valid delay 


0+ 


16 


0* 


14 


o* 


10 


ns 


(14.2) 


td(H1 L-XA) 


H1 low to XA valid delay 


0* 


12 


o* 


10 


0* 


9 


ns 


(15.1) 


WD)R 


D setup before H1 low (read) 


18 


16 


14 


ns 


(15.2) 


*su(XD)R 


XD setup before H1 low (read) 


21 


18 


16 


ns 


(16) 


th((X)D)R 


(X)D hold time after H1 low 
(read) 











ns 


(17.1) 


tsu(RDY) 


RDY setup before H1 high 


10 


8 


8 


ns 


(17.2) 


tsu(XRDY) 


XRDY setup before H1 high 


11 


9 


9 


ns 


(18) 


th((X)RDY) 


(X)RDY hold time after H1 high 











ns 



$ Guaranteed by design but not tested. 



Figure 13-12. Timing for Memory ( (M)STRB = 0) Write 
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Table 13-11. Timing Parameters for a Memory ( (M)STRB = 0) Read/Write (Figure 13-11 and Figure 13-12) 
(Concluded) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C30-33 


TMS320C30-40 


Unit 
Unit 


Min Max 


Min Max 


Min Max 


(19) 


td(H1H-(X)RWH) 


H1 high to (X)R/W high (write) 
delay 


13 


10 


9 


ns 


(20) 


*v((X)D)W 


(X)D valid after H1 low (write) 


25 


20 


17 


ns 


(21) 


th((X)D)W 


(X)D hold time after H1 high 
(write) 


0* 


0* 


0* 


ns 


(22.1) 


td(H1H-A) 


H1 high to A valid on back-to- 
back write cycles (write) delay 


23 


18 


15 


ns 


(22.2) 


td(H1 H-XA) 


H1 high to XA valid on back-to- 
back write cycles (write) delay 


32 


25 


21 


ns 


(26) 


td(A-(X)RDY) 


(X)RDY delay from A valid delay 


10t 


8t 


7t 


ns 



t Guaranteed from characterization but not tested, 
t Guaranteed by design but not tested. 



13-24 



TMS320C3x Signal Descriptions and Electrical Characteristics 



Timing 



Figure 13- 13. Timing for Memory ( IOSTRB = 0) Read 
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7a6/e /3-/£ 7//77//7^ Parameters for a Memory ( IOSTRB = 0) Read (Figure 13-13 and Figure 13-14) 



No. 


Name 


Description 


TMS320C30-27 


TMS320C30-33 


TMS32 


>0C30-40 


Unit 


Min 


Max 


Min 


Max 


Min 


Max 


(11.1) 


td(HIH-IOSL) 


H1 high to IOSTRB low delay 


0* 


13 


0* 


10 


ot 


9 


ns 


(12.1) 


td(HIH-IOSH) 


H1 high to IOSTRB high 
delay 


0* 


13 


0* 


10 


0* 


9 


ns 


(13.1) 


td(HIL-XRWH) 


H1 low to XR/W high delay 


0* 


13 


0* 


10 


ot 


9 


ns 


(14.3) 


td(H1 L-XA) 


H1 low to XA valid delay 


0* 


13 


0* 


10 


0* 


9 


ns 


(15.3) 


*su(XD)R 


XD setup before H1 high 


19 


15 


13 


ns 


(16.1) 


*h(XD)R 


XD hold time after H1 high 











ns 


(17.3) 


tsu(XRDY) 


XRDY setup before H1 high 


11 


9 


9 


ns 


(18.1) 


th(XRDY) 


XRDY hold time after H1 
high 











ns 



+- Guaranteed by design but not tested. 
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Figure 13-14. Timing for Memory ( IOSTRB - 0) Write 
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Table 13-13. Timing Parameters for a Memory ( IOSTRB = 0) Write (Figure 13-14) 



No. 


Name 


Description 


TMS320C30-27 


TMS320C30-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(23) 


td(HIL-XRWL) 


H1 low to XR/W low delay 


Ot- 19 


0* 15 


0* 13 


ns 


(24) 


MXD)W 


XD valid after H1 high 


38 


30 


" 25 


ns 


(25) 


th(XD)W 


XD hold time after H1 low 











ns 



+ Guaranteed by design but not tested. 
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13.6.3 XFO and XF1 Timing When Executing LDFI or LDII 



Table 1 3-1 4 defines timing parameters for XFO and XF1 when you execute 
LDFI or LDII. The numbers shown in parentheses in Figure 1 3-1 5 correspond 
with those in the No. column of Table 13-14. 



Figure 13-15. Timing for XFO andXFI When Executing LDFI or LDII 
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Table 13-14. Timing Parameters for XFO and XF1 When Executing LDFI or LDII (Figure 13-15) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


td(H3H-XF0L) 


H3 high to XFO low delay 


19 


15 


13 


ns 


(2) 


tsu(XF1) 


XF1 setup before H1 low 


13 


10 


9 


ns 


(3) 


thfXF-H 


XF1 hold time after H1 low 











ns 
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13.6.4 XFO Timing When Executing STFI and STII 



Table 1 3-1 5 defines the timing parameters for the XFO and XF1 when you ex- 
ecute STFI or STII. The numbers shown in parentheses in Figure 13-1 6 corre- 
spond with those in the No. column of Table 13-15. 



Figure 13-16. Timing for XFO When Executing a STFI or STII 
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Table 13-15. Timing Parameters for XFO When Executing STFI or STII (Figure 13-16) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


td(H3H-XF0H) 


H3 high to XFO high delay 


19 


15 


13 


ns 
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13.6.5 XFO and XF1 Timing When Executing SIGI 



Table 1 3-1 6 defines the timing parameters for the XFO and XF1 when you ex- 
ecute SIGI. The numbers shown in parentheses in Figure 13-17 correspond 
with those in the No. column of Table 13-16. 



Figure 13-17. Timing for XFO andXFI When Executing SIGI 
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Table 13-16. Timing Parameters for XFO andXFI When Executing SIGI (Figure 13-17) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


td(H3H-XF0L) 


H3 high to XFO low delay 


19 


15 


13 


ns 


(2) 


td(H3H-XF0H) 


H3 high to XFO high delay 


19 


15 


13 


ns 


(3) 


tsu(XF1) 


XF1 setup before H1 low 


13 


10 


9 


ns 


' (4) 


thfXF-O 


XF1 hold time after H1 low 











ns 
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13.6.6 Loading When the XF Pin Is Configured as an Output 



Table 13-17 defines the timing parameters for loading the XF register when 
the XF pin is configured as an output. The numbers shown in parentheses in 
Figure 1 3-1 8 correspond with those in the No. column of Table 13-17. 



Figure 13-18. Timing for Loading XF Register When Configured as an Output Pin 
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Table 13-17. Timing Parameters for Loading XF Register When Configured as an Output Pin (Figure 13-18) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


VH3H-XR 


H3 high to XF valid 


19 


15 


13 


ns 
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13.6.7 Changing the XF Pin From an Output to an Input 



Table 13-18 defines the timing parameters for changing the XF pin from an 
output pin to an input pin. The numbers shown in parentheses in Figure 1 3-1 9 
correspond with those in the No. column of Table 13-18. 



Figure 13-19. Timing for Change ofXF From Output to Input Mode 
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Table 13-18. Timing Parameters ofXF Changing From Output to Input Mode (Figure 13-19) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


th(H3H-XF01) 


XF hold after H3 high 


19 


15 


13 


ns 


(2) 


*su(XF) 


XF setup before H1 low 


13 


10 


9 


ns 


(3) 


th(XR 


XF hold after H1 low 











ns 
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13.6.8 Changing the XF Pin From an Input to an Output 



Table 1 3-1 9 defines the timing parameters for changing the XF pin from an in- 
put pin to an output pin. The numbers shown in parentheses in Figure 13-20 
correspond with those in the No. column of Table 13-19., 



Figure 13-20. Timing for Change of XF From Input to Output Mode 
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Table 13-19. Timing Parameters ofXF Changing From Input to Output Mode (Figure 13-20) 



(D 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(D 


td(H3H-XFIO) 


H3 high to XF switching from input to 
output delay 


25 


20 


17 


ns 
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13.6.9 Reset Timing 



RESET is an asynchronous input that can be asserted at any time during a 
clock cycle. If the specified timings are met, the exact sequence shown in 
Figure 1 3-21 will occur; otherwise, an additional delay of one clock cycle may 
occur. 

The asynchronous reset signals include XFO/1, CLKXO/1, DXO/1, FSXO/1, 
CLKRO/1 , DRO/1 , FSRO/1 , and TCLKO/1 . 



Table 13-20 defines the timing parameters for the RESET signal. The num- 
bers shown in parentheses in Figure 13-21 correspond with those in the No. 
column of Table 1 3-20. 

Resetting the device initializes the primary and expansion bus control registers 
to 7 software wait states and therefore results in slow external accesses until 
these registers are initialized. 



Note also that HOLD is an asynchronous input and can be asserted during re- 
set. 
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Figure 13-21. Timing for RESET 
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Notes: 1) (X)D includes D31— DO and XD31—XD0. _ 

2) (X)A includes A23— AO , XA12— XAO, a nd ( X)R/W. 

3) Control signals include STRB, MSTRB, and IOSTRB. 

4) Asynchr onously reset signals include XFO/1 , CLKXO/1 , DXO/1 , FSXO/1 , CLKRO/1 , DRO/1 , FSRO/1 , and TCLKO/1 . 

5) RESET is an asynchronous input and can be asserted at any point during a clock cycle. If the specified timings are 
met, the exact seguence shown will occur; otherwise, an additional delay of one clock cycle may occur. 

6) Note that the R/W and XR/W outputs are placed in a high-impedance state during reset and can be provided with 
a resistive pull-up, nominally 1 8-22 k£2, if undesirable spurious writes could be caused when these outputs go low. 

7) Reset vector is fetched three times with 7 software wait states each. 
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Table 13-20. Timing Parameters for RESET (Figure 13-21) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


tsu(RESET) 


Setupfor RESET beforeCLKIN 
low 


28 Pt 


10 Pt 


10 Pt 


ns 


(2.1) 


td(CLKINH-HIH) 


CLKIN high to H1 high delay § 


6 20 


6 16 


4 14 


ns 


(2.2) 


kj(CLKINH-HIL) 


CLKIN high to H1 low delay § 


6 20 


6 16 


4 14 


ns 


(3) 


tsu(RESETH-HIL) 


Setup for RESET high 
before H1 low and after 10 H1 
clock cycles 


13 


10 


9 


ns 


(5.1) 


tcKCLKINH-HSL) 


CLKIN high to H3 low delay § 


6 20 


6 16 


4 14 


ns 


(5.2) 


td(CLKINH-H3H) 


CLKIN high to H3 high delay § 


6 20 


6 16 


4 14 


ns 


(8) 


klis(H1H-(X)D) 


H1 high to (X)D disabled (high 
impedance) 


19t 


15t 


13t 


ns 


0) 


tdis(H3H-(X)A) 


H3 high to (X)A disabled (high 
impedance) 


13t 


10t 


9t 


ns 


(10) 


^(HSH-CONTROLH) 


H3 high to control signals high 
delay 


13t 


10t 


9t 


ns 


(11) 


td(HIH-IACKH) 


H1 high to IACK high delay 


13t 


10t 


9t 


ns 


(12) 


tdis(RESETL-ASYNCH) 


RESET low to asynchronously 
reset signals disabled (high im- 
pedance) 


31T 


25t 


2lt 


ns 



t Characterized but not tested. 

§ See Figure 1 3-22 for temperature dependence for the 33 MHz TMS320C30. 

Note: P = t c (d) 

Figure 13-22. CLKIN to H1/H3 as a Function of Temperature 
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13.6.10 SHZ Pin Timing 



Table 13-21 defines the timing parameters for the SHZ pin. The numbers 
shown in parentheses in Figure 13-23 correspond with those in the No. col- 
umn of Table 13-21. 



Figure 13-23. Timing for SHZ Pin 
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Note: Enabling SHZ destroys TMS320C3x register and memory contents. Assert SHZ = 1 and reset the TMS320C3x to restore 
it to a known condition. 



Table 13-21. Timing Parameters for SHZ Pin (Figure 13-23) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


0) 


tdis(SHZ) 


SHZ low to all 0, I/O pins dis- 
abled (high impedance) 


Ot 2Pt 


Ot 2Pt 


Ot 2Pt 


ns 


(2) 


ten(SHZ) 


SHZ high to all 0, I/O pins en- 
abled (active) 


Ot 2Pt 


Ot 2Pt 


Ot 2Pt 


ns 



t Characterized but not tested. 
Note: P = t C (d) 
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13.6.11 Interrupt Response Timing 



Table 1 3-22 defines the timing parameters for the INT signals. The numbers 
shown in parentheses in Figure 13-24 correspond with those in the No. col- 
umn of Table 13-22. 



Figure 13-24. Timing for INT3-INT0 Response 
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Table 13-22. Timing Parameters for INT3-INT0 (Figure 13-24) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


tsu(INT) 


INT3-INT0 setup before H1 
low 


19 


15 


13 


ns 


(2) 


tw(INT) 


Interrupt pulse duration to 
guarantee only one interrupt 
seen 


P 2Pt 


P 2Pt 


P 2Pt 


ns 



t Characterized but not tested. 

Note: P = t c (H) 

The interrupt (INT) pins are asynchronous inputs that can be asserted at any 
time during a clock cycle. The TMS320C3x interrupts are level sensitive, not 
edge sensitive. Interrupts are detected on the falling edge of H1. Therefore, 
interrupts must be set up and held to the falling edge of H1 for proper detection. 
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For the processor to recognize only one interrupt on a given input, an interrupt 
pulse must be set up and held to: 

Qi a minimum of one H1 falling edge, and 

Q no more than two H1 falling edges. 

The TMS320C3x can accept an interrupt from the same source every two H1 
clock cycles. 

If the specified timings are met, the exact sequence shown in Figure 1 3-24 will 
occur; otherwise, an additional delay of one clock cycle may occur. 
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1 3.6.1 2 Interrupt Acknowledge Timing 



The IACK output is active for the entire duration of the bus cycle. Its activity is 
extended if the bus cycle utilizes wait states. 



Table 1 3-23 defines the timing parameters for the IACK signal. The numbers 
shown in parentheses in Figure 13-25 correspond with those in the No. col- 
umn of Table 13-23. 



Figure 13-25. Timing for IACK 
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Table 13-23. Timing Parameters for IACK (Figure 13-25) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


*d(H1 H-IACKL) 


H1 high to IACK low delay 


13 


10 


9 


ns 


(2) 


td(l-IIH-IACKH) 


H1 high to IACK high delay 


13 


10 


9 


ns 



Note: 



The IACK output is active for the entire duration of the bus cycle and is therefore extended if the bus cycle utilizes wait 
states. 
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13.6.13 Data Rate Timing Modes 



Unless otherwise indicated, the data rate timings shown in Figure 13-26 and 
Figure 13-27 are valid for all serial port modes, including handshake. For a 
functional description of serial port operation, refer to subsection 8.2.1 2 of the 
TMS320C3x User's Guide. 

Table 1 3-24 defines the timing parameters for the serial port timing. The num- 
bers shown in parentheses in Figure 13-26 correspond with those in the No. 
column of Table 13-24. 



Figure 13-26. Timing for Fixed Data Rate Mode 
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Notes: 1) Timing diagrams show operations with CLKXP = CLKRP = FSXP = FSRP = 0. 

2) Timing diagrams depend upon the length of the serial port word, where n = 8, 1 6, 24, or 32 bits, respectively. 
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Figure 13-27. Timing for Variable Data Rate Mode 
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Notes: 



1) Timing diagrams show operation with CLKXP = CLKRP = FSXP = FSRP = 0. 

2) Timing diagrams depend upon the length of the serial port word, where n = 8, 1 6, 24, or 32 bits, respectively. 

3) The timings that are not specified expressly for the variable data rate mode are the same as those that are specified 
for the fixed data rate mode. 
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Table 13-24. Serial Port Timing Parameters (Figure 13-26 and Figure 13-27) 



No. 


Name 






TMS320C30-27 
TMS320C31-27 


Unit 


Description 


Min Max 


(D 


*d(H1-SCK) 


H1 high to internal CLKX/R delay 


19 


ns 


(2) 


tc(SCK) 


CLKX/R cycle time 


CLKX/R ext 


t c(H )x2.6t 


ns 


CLKX/R int 


tc(H)X2 t c(H )X232t 


(3) 


tw(SCK) 


CLKX/R high/low pulse du- 
ration 


CLKX/R ext 


1c(H)+12t 


ns 


CLKX/R int 


[tc(SCK)/2]-15 [t G (SCK)/2]+5 


(4) 


tr(SCK) 


CLKX/R rise time 


10t 


ns 


(5) 


tf(SCK) 


CLKX/R fall time 


10t 


ns 


(6) 


*d(DX) 


CLKX to DX valid delay 


CLKX ext 


44 


ns 


CLKX int 


25 


(7) 


tsu(DR) 


DR setup before 
CLKR low 


CLKR ext 


13 


ns 


CLKR int 


31 


(8) 


th(DR) 


DR hold from 
CLKR low 


CLKR ext 


13 


ns 


CLKR int 





(9) 


td(FSX) 


CLKX to internal 
FSX high/low delay 


CLKX ext 


40 


ns 


CLKX int 


21 


(10) 


Isu(FSR) 


FSR setup before CLKR 
low 


CLKR ext 


13 


ns 


CLKR int 


13 


(11) 


th(FS) 


FSX/R input hold from 
CLKX/R low 


CLKX/R ext 


13 


ns 


CLKX/R int 





(12) 


tsu(FSX) 


External FSX setup before 
CLKX 


CLKX ext 


-[tc(H)"8] [t C (SCK/2]-10t 


ns 


CLKX int 


-[t c(H )-21] tcfsCK)^* 


(13) 


1d(CH-DX)V 


CLKX to first DX bit, FSX pre- 
cedes CLKX high delay 


CLKX ext 


45 


ns 


CLKX int 


26 


(14) 


td(FSX-DX)V 7 


FSX to first DX bit, CLKX precedes FSX delay 


45 


ns 


(15) 


td(DXZ) 


CLKX high to DX high impedance following last 
data bit delay 


25t 


ns 



t Guaranteed by design but not tested. 
$ Not tested. 
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Table 13-24. Serial Port Timing Parameters (Figure 13-26 and Figure 13-27) (Continued) 


No. 


Name 


Description 


TMS320C30-33 
TMS320C31-33 


Unit 


Min Max 


(1) 


tdd-H-SCK) 


H1 high to internal CLKX/R delay 


15 


ns 


(2) 


tc(SCK) 


CLKX/R cycle time 


CLKX/R ext 


t c( H)x2.6t 


ns 


CLKX/R int 


*C(H)X2 t c(H )x232t 


(3) 


tw(SCK) 


CLKX/R high/low pulse du- 
ration 


CLKX/R ext 


t c( H)+12t 


ns 


CLKX/R int 


[tc(SCK)/2]-15 [t C (SCK)/2]+5 


(4) 


V(SCK) 


CLKX/R rise time 


8t 


ns 


(5) 


tf(SCK) 


CLKX/R fall time 


8f 


ns 


(6) 


kl(DX) 


CLKX to DX valid delay 


CLKX ext 


35 


ns 


CLKX int 


20 


(7) 


tsu(DR) 


DR setup before 
CLKR low 


CLKR ext 


10 


ns 


CLKR int 


25 


(8) 


th(DR) 


DR hold from 
CLKR low 


CLKR ext 


10 


ns 


CLKR int 





(9) 


td(FSX) 


CLKX to internal 
FSX high/low delay 


CLKX ext 


32 


ns 


CLKX int 


17 


(10) 


4 su(FSR) 


FSR setup before 
CLKR low 


CLKR ext 


10 


ns 


CLKR int 


10 


(11) 


l h(FS) 


FSX/R input hold from 
CLKX/R low 


CLKX/R ext 


10 


ns 


CLKX/R int 





(12) 


tsu(FSX) 


External FSX setup before 
CLKX 


CLKX ext 


-[tc(H)"8] [t c (SCK)/2]-10t 


ns 


CLKX int 


[tc(H)"21] t c(SC K)/2* 


(13) 


td(CH-DX)V 


CLKX to first DX bit, FSX pre- 
cedes 

CLKX high delay 


CLKX ext 


36 


ns 


CLKX int 


21 


(14) 


td(FSX-DX)V 


FSX to first DX bit, CLKX precedes FSX delay 


36 


ns 


(15) 


td(DXZ) 


CLKX high to DX high impedance following last 
data bit delay 


20t 


ns 



t Guaranteed by design but not tested. 
* Not tested. 
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Table 13-24. Serial Port Timing Parameters (Figure 13-26 and Figi 


ire 13-27) (Concluded) 




No. 


Name 


Description 


TMS320C30-40 


Unit 


Min Max 


0) 


tdd-ll-SCK) 


H1 high to internal CLKX/R delay 


13 


ns 


(2) 


tc(SCK) 


CLKX/R cycle time 


CLKX/R ext 


t c ( H )x2.6t 


ns 


CLKX/R int 


tc(H)><2 t c(H )x232* 


(3) 


tw(SCK) 


CLKX/R high/low pulse du- 
ration 


CLKX/R ext 


tc(H)+10t 


ns 


CLKX/R int 


[tc(SCK)/2]-5 [t G (SCK)/2]+5 


(4) 


*r(SCK) 


CLKX/R rise time 


7t 


ns 


(5) 


tf(SCK) 


CLKX/R fall time 


7t 


ns 


(6) 


td(DX) 


CLKX to DX valid delay 


CLKX ext 


30 


ns 


CLKX int 


17 


(7) 


tsu(DR) 


DR setup before 
CLKR low 


CLKR ext 


9 


ns 


CLKR int 


21 


(8) 


th(DR) 


DR hold from 
CLKR low 


CLKR ext 


9 


ns 


CLKR int 





(9) 


td(FSX) 


CLKX to internal 
FSX high/low delay 


CLKX ext 


27 


ns 


CLKX int 


15 


(10) 


t su(FSR) 


FSR setup before 
CLKR low 


CLKR ext 


9 


ns 


CLKR int 


9 


(11) 


*h(FS) 


FSX/R input hold from 
CLKX/R low 


CLKX/R ext 


9 


ns 


CLKX/R int 





(12) 


tsu(FSX) 


External FSX setup before 
CLKX 


CLKX ext 


-[t C (H)-8] [tc(SCK)/2]-10* 


ns 


CLKX int 


4tc(H)-21] t c(S CK)/2* 


(13) 


td(CH-DX)V 


CLKX to first DX bit, FSX pre- 
cedes CLKX high delay 


CLKX ext 


30 


ns 


CLKX int 


18 


(14) 


*d(FSX-DX)V 


FSX to first DX bit, CLKX precedes FSX delay 


30 


ns 


(15) 


td(DXZ) 


CLKX high to DX high impedance following last 
data bit delay 


17t 


ns 



t Guaranteed by design but not tested, 
t Not tested. 
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13.6.14 HOLD Timing 



HOLD is an asynchronous input that can be asserted at any time during a clock 
cycle. If the specified timings are met, the exact sequence shown in 
Figure 1 3-28 will occur; otherwise, an additional delay of one clock cycle may 
occur. 



Table 1 3-25 defines the timing parameters for the HOLD and HOLDA signals. 
The numbers shown in parentheses in Figure 1 3-28 correspond with those in 
the No. column of Table 1 3-25. 

The NOHOLD bit of the prima ry bus c ontrol register (refer to Chapter 7, sub- 
section 7.1.1) overrides the HOLD signal. When this bit is set, the device 
comes out of hold and prevents future hold cycles from occurring. 



Figure 13-28. Timing for HOLD/HOLDA 
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Note: HOLDA will go low in response to HOLD going low and will continue to remain lowuntil one H1 cycle after HOLD goes back 
high as shown in Figure 13-28. 
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Table 13-25. Timing Parameters for HOLD/HOLD A (Figure 13-28) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min 


Max 


Min 


Max 


Min 


Max 


(1) 


Whold) 


HOLD setup before H1 low 


19 


15 


13 


ns 


(3) 


MHOLDA) 


HOLDA valid after H1 low 


Ot 


14 


Ot 


10 


Ot 


9 


ns 


(4) 


tw(HOLD) 


HOLD low duration 


2tc(H) 


2t c (H) 


2t c(H) 


ns 


(6) 


tw(HOLDA) 


HOLDA low duration 


tcH-5t 


tcH-5t 


tcH"5t 


ns 


(7) 


td(H1L-SH)H 


H1 low to STRB high for a HOLD 
delay 


0* 


13 


Ot 


10 


Ot 


9 


ns 


(8) 


tdis(HIL-S) 


H1 low to STRB disabled (high-im- 
pedance state) 


0* 


13t 


ot 


10t 


ot 


9t 


ns 


0) 


*en(H1L-S) 


H1 low to STRB enabled (active) 


0* 


13 


ot 


10 


ot 


9 


ns 


(10) 


tdis(ML-RW) 


H1 low to R/W disabled (high-im- 
pedance state) 


0* 


13t 


ot 


not 


ot 


9t 


ns 


(11) 


ten(H1 L-RW) 


H1 low to R/W enabled (active) 


Ot 


13 


ot 


10 


Ot 


9 


ns 


(12) 


tdis(H1 L-A) 


H1 low to address disabled (high- 
impedance state) 


Ot 


13t 


ot 


10t 


ot 


9t 


ns 


(13) 


ten(HIL-A) 


H1 low to address enabled (valid) 


0* 


19 


ot 


15 


ot 


13 


ns 


(16) 


tdis(HIH-D) 


H1 high to data disabled (high-im- 
pedance state) 


Ot 


13t 


Ot 


10t 


Ot 


9t 


ns 



t Characterized but not tested, 
t Not t ested. 

Note: HOLD is an asynchronous input and can be asserted at any point during a 
exact sequence shown will occur; otherwise, an additional delay of one 



clock cycle. If the specified timings are met, the 
clock cycle may occur. 
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13.6.15 General-Purpose I/O Timing 



Peripheral pins include CLKXO/1 , CLKRO/1 , DXO/1 , DRO/1 , FSXO/1 , FSRO/1 , 
and TCLKO/1. The contents of the internal control registers associated with 
each peripheral define the modes for these pins. 



13.6. 15. 1 Peripheral Pin I/O Timing 



Table 13-26 defines peripheral pin general-purpose I/O timing parameters. 
The numbers shown in parentheses in Figure 1 3-29 correspond with those in 
the No. column of Table 13-26. 



Figure 13-29. Timing for Peripheral Pin General-Purpose I/O 
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Table 13-26. Timing Parameters for Peripheral Pin General-Purpose I/O (Figure 13-29) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


tsu(GPIOHIL) 


General-purpose input setup before 
H1 low 


15 


12 


10 


ns 


(2) 


th(GPIOHIL) 


General-purpose input hold time 
after H1 low 











ns 


(3) 


td(GPIOHIH) 


General-purpose output delay after 
H1 high 


19 


15 


13 


ns 



Note: Peripheral pins include CLKXO/1 , CLKRO/1 , DXO/1 , DRO/1 , FSXO/1 , FSRO/1 , and TCLKO/1 . The modes of these pins are 
defined by the contents of internal control registers associated with each peripheral. 
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13.6. 15.2 Changing the Peripheral Pin I/O Modes 

Table 1 3-27 and Table 1 3-28 show the timing parameters for changing the pe- 
ripheral pin from a general-purpose output pin to a general-purpose input pin 
and vice versa. The numbers shown in parentheses in Figure 13-30 and 
Figure 13-31 correspond to those shown in the No. column of Table 13-27 
and Table 1 3-28. 

Figure 13-30. Timing for Change of Peripheral Pin From General-Purpose Output to Input Mode 
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Table 13-27. Timing Parameters for Peripheral Pin Changing From General-Purpose Output to Input Mode 
(Figure 13-30) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS320C30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min 


Max 


(1) 


th(H3H) 


Hold after H1 high 


19 


15 


13 


ns 


(2) 


tsu(GPIOHIL) 


Peripheral pin setup before H1 low 


13 


10 


9 


ns 


(3) 


tWGPIOH-IU 


Peripheral pin hold after H1 low 











ns 
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Figure 13-31. Timing for Change of Peripheral Pin From General-Purpose Input to Output Mode 
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Table 13-28. Timing Parameters for Peripheral Pin Changing From General-Purpose Input to Output Mode 
(Figure 13-31) 



No. 


Name 


Description 


TMS320C30-27 
TMS320C31-27 


TMS32QC30-33 
TMS320C31-33 


TMS320C30-40 


Unit 


Min Max 


Min Max 


Min Max 


(1) 


tti(GPIOHIH) 


H1 high to peripheral pin switching 
from input to output delay 


19 


15 


13 


ns 
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1 3.6.1 6 Timer Pin Timing 



Valid logic level periods and polarity are specified by the contents of the internal 
control registers. 

Table 13-29 defines the timing parameters for the timer pin. The numbers 
shown in parentheses in Figure 13-32 correspond with those in the No. col- 
umn of Table 13-29. 



Figure 13-32. Timing for Timer Pin 
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Table 13-29. Timing Parameters for Timer Pin (Figure 13-32) 



No. 


Name 


Description^ 


TMS320C30-27 
TMS320C31-27 


Unit 


Min Max 


(D 


tsu(TCLKHIL) 


TCLK ext setup before H1 low 


TCLK ext 


15 


ns 


(2) 


th(TCLKHIL) 


TCLK ext hold after H1 low 


TCLK ext 





ns 


(3) 


td(TCLKHIH) 


H1 high to TCLK int valid delay 


TCLK int ' 


13 


ns 


(4) 


tc(TCLK) 


TCLK cycle time 


TCLK ext 


t c(H )x2.6t 


ns 


TCLK int 


tc(H)><2 t c(H )x232t 


ns 


(5) 


tw(TCLK) 


TCLK high/low pulse duration 


TCLK ext 


tc(H)+12t 


ns 


TCLK int 


[tcfTCLK)/2]-15 [t G (TCLK)/2]+5 


ns 



t Guaranteed by design but not tested. 

t Timing parameters 1 and 2 are applicable for a synchronous input clock. Timing parameters 4 and 5 are applicable for an 
asynchronous input clock. 
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Table 13-29. Timing Parameters for Timer Pin (Figure 13-32) (Continued) 



No. 


Name 


Description^ 


TMS320C30-33 
TMS320C31-33 


Unit 


Min Max 


(D 


tsu(TCLKHIL) 


TCLK ext setup before H1 low 


TCLK ext 


12 


ns 


(2) 


th(TCLKH1 L) 


TCLK ext hold after H1 low 


TCLK ext 





ns 


(3) 


td(TCLKHIH) 


H1 high to TCLK int valid delay 


TCLK int 


10 


ns 


(4) 


tc(TCLK) 


TCLK cycle time 


TCLK ext 


tc(H)x2.6t 


ns 


TCLK int 


tc(H)><2 t c(H )x232t 


ns 


(5) 


tw(TCLK) 


TCLK high/low pulse duration 


TCLK ext 


tc(H)+12t 


ns 


TCLK int 


[tc(TCLK^]-15 [t c (TCLKV2]+5 


ns 



t Guaranteed by design but not tested. 

t Timing parameters 1 and 2 are applicable for a synchronous input clock. Timing parameters 4 and 5 are applicable for an 
asynchronous input clock. 

Table 13-29. Timing Parameters for Timer Pin (Figure 13-32) (Continued) 



No. 


Name 


Description^ 


TMS320C30-40 


Unit 


Min 


Max 


(1) 


tsu(TCLKHIL) 


TCLK ext setup before H1 low 


TCLK ext 


10 


ns 


(2) 


th(TCLKHIL) 


TCLK ext hold after H1 low 


TCLK ext 





ns 


(3) 


td(TCLKHIH) 


H1 high to TCLK int valid delay 


TCLK int 


9 


ns 


(4) 


tc(TCLK) 


TCLK cycle time 


TCLK ext 


t c( H)x2.6t 


ns 


TCLK int 


*c(H)x2 


t c(H )X232t 


ns 


(5) 


tw(TCLK) 


TCLK high/low pulse duration 


TCLK ext 


t c(H )+10t 


ns 


TCLK int 


[tc(TCLK)/2]-5 


[tc(TCLK)/2]+5 


ns 



t Guaranteed by design but not tested. 

t Timing parameters 1 and 2 are applicable for a synchronous input clock. Timing parameters 4 and 5 are applicable for an 
asynchronous input clock. 
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Instruction Opcodes 



The opcode fields for all the TMS320C3x instructions are shown in Table A-1 . 
Bits in the table marked with a hyphen are defined in the individual instruction 
description (see Chapter 11). Table A-1, along with the instruction descrip- 
tions, fully defines the instruction words. The opcodes are listed in numerical 
order. 



Table A-1. TMS320C3x Instruction Opcodes 



INSTRUCTION 


31 


30 


29 


28 


27 


26 


25 


24 


23 


ABSF 





























ABSI 


























1 


ADDC 























1 





ADDF 























1 


1 


ADDI 




















1 








AND 




















1 





1 


ANDN 




















1 


1 





ASH 




















1 


1 


1 


CMPF 




























CMPI 

























1 


FIX 






















1 





FLOAT 






















1 


1 


IDLE 



















1 








LDE 



















1 





1 


LDF 



















1 


1 





LDFI 



















1 


1 


1 


LDI 




























LDII 

























1 


LDM 






















1 





LSH 






















1 


1 


MPYF 



















1 








MPYI 



















1 





1 


NEGB 



















1 


1 





NEGF 



















1 


1 


1 


NEGI 
















1 
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Table A-1. TMS320C3x Instruction Opcodes (Continued) 



INSTRUCTION 


31 


30 


29 


28 


27 


26 


25 


24 


23 


NOP 
























1 


NORM 





















1 





NOT 





















1 


1 


POP 


















1 








POPF 


















1 





1 


PUSH 


















1 


1 





PUSHF 


















1 


1 


1 


OR 




























RND 






















1 





ROL 






















1 


1 


ROLC 



















1 








ROR 



















1 





1 


RORC 



















1 


1 





RPTS 



















1 


1 


1 


STF 



























STFI 
























1 


STI 





















1 





STII 





















1 


1 


SIGI 


















1 








SUBB 


















1 





1 


SUBC 


















1 


1 





SUBF 


















1 


1 


1 


SUBI 



























SUBRB 
























1 


SUBRF 





















1 





SUBRI 





















1 


1 


TSTB 


















1 








XOR 


















1 





1 


IACK 


















1 


1 





ADDC3 




























ADDF3 

























1 


ADDI3 






















1 





AND3 






















1 


1 


ANDN3 



















1 








ASH3 



















1 





1 


CMPF3 



















1 


1 





CMPI3 



















1 


1 


1 
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Table A-1. TMS320C3x Instruction Opcodes (Continued) 



INSTRUCTION 


31 


30 


29 


28 


27 


26 


25 


24 


23 


LSH3 



























MPYF3 
























1 


MPYI3 





















1 





OR3 





















1 


1 


SUBB3 


















1 








SUBF3 


















1 





1 


SUB13 










0, 







1 


1 





TSTB3 


















1 


1 


1 


XOR3 













1 














LDFcond 













- 


- 


- 


- 


- 


LD\cond 










1 


- 


- 


- 


- 


- 


BR(D)t 





















- 


- 


CALL 


















1 


- 


- 


RPTB 















1 





- 


- 


SWI 















1 


1 


- 


- 


Bcond(D)~t 












1 





- 


- 


- 


DBcond{D) t 












1 


1 


- 


- 


- 


CALlcond 









1 








- 


- 


- 


TRAPcond 









1 





1 





- 


- 


RETI cond 









1 


1 














RETS cond 









1 


1 











1 


MPYF3||ADDF3 





































1 
1 




1 



1 


- 


MPYF3||SUBF3 




























1 
1 
1 
1 





1 
1 




1 



1 


- 


MPY!3||ADDI3 






















1 
1 
1 
1 











1 
1 




1 



1 


- 



t Opcode same for standard and delayed instructions. 
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Table A-1. TMS320C3x Instruction Opcodes (Concluded) 



INSTRUCTION 


31 


30 


29 


28 


27 


26 


25 


24 


23 


MPYI3||SUBI3 






















1 
1 
1 
1 


1 
1 
1 
1 





1 
1 




1 



1 


- 


STF||STF 





















- 


- 


STI||STI 


















1 


- 


- 


LDF||LDF 















1 





- 


- 


LDI||LDI 















1 


1 


- 


- 


ABSFHSTF 












1 








- 


- 


ABSI||STI 












1 





1 


- 


- 


ADDF3||STF 












1 


1 





- 


- 


ADDI3||STI 












1 


1 


1 


- 


- 


AND3||STI 




















- 


- 


ASH3||STI 

















1 


- 


- 


FIX||STl 














1 





- 


- 


FLOAT||STF 














1 


1 


- 


- 


LDF||STF 











1 








- 


- 


LDI||STI 











1 





1 


- 


- 


LSH3||STI 











1 


1 





- 


- 


MPYF3||STF 











1 


1 


1 


- 


- 


MPYI3||STI 




















- 


- 


NEGF||STF 

















1 


- 


- 


NEGI||STI 














1 





- 


- 


NOT||STI 














1 


1 


- 


- 


OR3||STI 











1 








- 


- 


SUBF3||STF 











1 





1 


- 


- 


SUBI3||STI 











1 


1 





- 


- 


XOR3||STI 











1 


1 


1 


- 


- 


Reserved for reset, 
traps, and interrupts 





1 


1 


1 


1 


1 


1 


1 


1 
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Introduction 



Architectural Overview 



GPU Registers, Memory, and Cache 



Data Formats and Floating-Point Operation 



Addressing 



Program Flow Control 



External Bus Operation 



Peripherals 



Pipeline Operation 



Assembly Language Instructions 



Software Applications 



Hardware Applications 



TMS320C3x Signal Description and Electrical Characteristics 
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Development Support/Part Order Information 



Quality and Reliability 



Calculation of TMS320C30 Power Dissipation 



SMJ320C30 Digital Signal Processor Data Sheet 



Quick Reference Guide 
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Appendix B 

Development Support/Part Order Information 



This appendix provides development support information, device part num- 
bers, and support tool ordering information for the TMS320C3x generation. 
Figure B-1 shows the software and hardware development tools available and 
the development environment for the TMS320C3x. 



Figure B-1. TMS320C3x Development Environment 
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B.1 TMS320C3x Development Support 

Texas Instruments supplies extensive development support and complete 
documentation for the TMS320C3x generation of digital signal processors. Tl 
also offers a complete line of software and hardware tools (shown in Table B-2 
on page B-1 5) to support in application development, evaluation of the proces- 
sor performance, algorithm implementations, and full integration of design 
modules. 

Software development tools include 

Q TMS320C3x Macro Assembler/Linker 

Ql TMS320C3x Optimizing ANSI C Compiler 

Q TMS320C3x Simulator 

Q SPOX (the TMS320C3x Operating System). 

Hardware development tools consist of 

□ TMS320C30 Evaluation Module (EVM) 
Q TMS320C3x XDS500 Emulator 

Q TMS320C30 XDS1 000 Development System 

□ TMS320C30 Hewlett-Packard 64776 Analysis Subsystem 

The macro assembler/linker converts assembly language into executable ob- 
ject code. The TMS320C3x optimizing C compiler supports high-level lan- 
guage programming and is a full implementation of the standard ANSI C lan- 
guage. The simulator is a software program that simulates nonreal-time opera- 
tion of the TMS320C3x, allowing verification and monitoring of the state of the 
processor. 

Both the TMS320C3X XDS500 Emulator and the TMS320C30 XDS1 000 De- 
velopment Environment provide full-speed, in-circuit emulation for 
TMS320C3x system design and debug on the IBM PC/AT and compatible de- 
vices. The TMS320C3x XDS500 supports hardware and software debug of 
your target system. The TMS320C30 XDS1000 provides system needs from 
concept to prototype. It includes the XDS500 Emulator Board and the 
TMS320C30 Application Board (a predefined sample target system that con- 
tains a TMS320C30). 

These hardware and software products are easy to use and offer the designer 
the tools needed to significantly reduce TMS320C3x system development time 
and cost. 

A description of key features for each TMS320C3x development support tool 
is provided in the following subsections. For ordering information, see Section 
B.4. For detailed information on these tools, refer to TMS320 Family Develop- 
ment Support Reference Guide (literature number SPRU01 1 B). Call the Cus- 
tomer Response Center at 800-336-5236 to request a copy. 
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B.1.1 Macro Assembler/Linker 



The TMS320C3x Macro Assembler/Linker is a software tool that converts 
source mnemonics to executable object code. It is distinguished by these key 
features: 

□ Macro capabilities and library functions 

Qi Conditional assembly 

Q Relocatable modules 

Ql Complete error diagnostics 

Qi Symbol table and cross-reference 

To address specific needs, the TMS320C3x Macro Assembler/Linker is 
shipped with four programs: 

1 ) The assembler translates assembly language source files into machine 
language object files. 

2) The archiver collects a group of files — object, source, or macros — into a 
single archive file. 

3) The linker combines object files into a single executable object module. 

4) The object format converter changes the object file into Intel, Tektronix, 
or Tl-tagged object format. The converted file can be downloaded to an 
EPROM programmer; the EPROM code can then be executed on the 
TMS320C3x device. 

The main purpose of this development process, shown in Figure B-1 , is to pro- 
duce a module that can be executed in a system that contains a TMS320C3x 
device or the software or hardware development tools. 

The macro assembler/linker is currently available for PC/MS-DOS (versions 
3.0 and up) and OS/2, Macintosh MPW, VAX VMS, SUN-3, and SUN-4 UNIX 
operating systems. 



B.1 .2 Optimizing ANSI C Compiler 



The Tl C compiler translates the widely used ANSI C language directly into 
highly optimized assembly code. This code is then assembled and linked using 
Tl's assembler/linker, which is shipped with the compiler. 

The C compiler provides for enhanced productivity by enabling the application 
designer to program in C, thus making code easier to prototype, debug, and 
benchmark. Furthermore, already existing code can be directly compiled and 
executed on a TMS320C3x. 
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The TMS320C3x Optimizing ANSI C Compiler is a full-featured C compiler. 
Compiler features include 

Qi Complete and exact conformity to the ANSI C specification. 

Ql Highly efficient code. The compiler incorporates state-of-the-art generic 
and target-specific optimizations. 

Q C programs that can be linked with assembly language routines, allowing 
hand coding of time-critical functions in TMS320C3x assembly language. 

Qi ANSI-standard run-time library. 

□ AC shell program to facilitate one-step translation from C source to ex- 
ecutable code. 

Qi A variety of listing files. 

Qi Fast compilation to increase productivity. 

Qi Complete and useful diagnostics (error messages). 

ti Validation with the de facto industry standard Plum Hall and Perennial vali- 
dation suites. 

Below are key optimizations performed by the compiler. 

£1 TMS320C3x-specific optimations 

■ Register variables 

■ Register tracking/targeting 

■ Cost-based register allocation 

■ Autoincrement addressing modes 

■ Repeat blocks 

■ TMS320C3x parallel instructions 

■ Conditional instructions 

■ TMS320C3x delayed branches 

Qi General-purpose C optimizations 

■ Algebraic reordering/symbolic simplification/constant folding 

■ Data flow optimizations 

■ Copy propagation 

■ Common subexpression elimination 

■ Redundant assignment elimination 

■ Alias disambiguation 

■ Branch optimizations/controlled-flow simplification 

■ Loop induction variable optimizations/strength reduction 

■ Loop unrolling 

■ Loop rotation 
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B.1.3 Simulator 



TMS320C3x Development Support 



& Loop-invariant code motion 

n In-line expansion of run-time support library functions 

The compiler supports DEC VAX/VMS, IBM-PC with PC-DOS' (versions 3.0 
and up) or OS/2 compatibles, Macintosh MPW, and SUN-3 and SUN-4 UNIX 
systems. 

The assembler/linker is included with the shipment of the C compiler. 



The TMS320C3x Simulator is a software program that simulates operation of 
the TMS320C3x. These key features make the simulator effective and flexible 
in TMS320C3x software development: 

□ Simulation of the entire TMS320C3x digital signal processor instruction 
set 

Q Simulation of the key TMS320C3x peripheral features (DMA, timers, and 
serial port) 

Ql Command entry eitherfrom menu-driven keystrokes (menu mode) or from 
a batch file (line mode) 

□ Help menus for all screen modes 

Qi Standard interface that can be user customized 

Qi Quick storage and retrieval of simulation parameters from files to facilitate 
preparation for individual sessions 

□l Reverse assembly that allows editing and reassembly of source state- 
ments 

Qi Memory contents that can be displayed in hexadecimal 32-bit values and 
assembled source at the same time. 

Q Execution modes including single-step, until, while, for, and run to break- 
point or user halt. 

□l Breakpoints 

Qi Simulation of cache utilization 

Q Cycle counting 

The simulator allows verification and monitoring of the state of the processor. 
Simulation occurs at thousands of instructions per second (VAX/VMS and 
SUN-3/SUN-4 UNIX) or hundreds of instructions per second (PC/MS-DOS). 

The user interface in the simulator is identical to that in the XDS. See 
Figure B-2 for an example. 
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Figure B-2. TMS320C3x Simulator User Interface 
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The simulator currently supports PC/MS-DOS, VAX VMS, and VAX Ultrix oper- 
ating systems, and SUN-3 and SUN-4 UNIX systems. 



B.1 .4 The TMS320C3x Operating System (SPOX) 



SPOX, developed by Spectron Microsystems Inc., is the industry's first hard- 
ware-independent software base for a real-time DSP operating system. SPOX 
features a set of high-level C-callable software functions, which are indepen- 
dent of the underlying hardware platform, thus insulating real-time DSP appli- 
cations from many low-level system details. 

SPOX differs from other operating systems or real-time kernels (such as UNIX) 
just as the TMS320C3x differs from a general-purpose microprocessor: both 
SPOX and the TMS320C3x are application-specific. SPOX affords its users 
two important benefits: software productivity and application portability. 

Functional Components 

The SPOX software interface augments high-level programming languages 
like C by accessing a virtual DSP machine that consists of four functional com- 
ponents: 

1 ) DSP MATH furnishes application software with a rich set of operations 
used to manipulate vectors, matrices, and filters. 
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2) Hierarchical Memory Management gives the application explicit control in 
allocating data storage from different segments of system memory. 

3) Stream I/O presents a device-independent application interface used to 
input and outputs blocks of data from a variety of peripherals. 

4) Realtime Keme/provides primitives for scheduling and synchronizing mul- 
tiple, prioritized tasks. 

Figure B-3 illustrates the functional components of SPOX. 
Figure B-3. Internal SPOX Architecture 
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SPOX Product Offering 

The SPOX software interface is supported in different execution environments, 
including Sun workstations, IBM personal computers, and VAX minicomput- 
ers. On these host systems, DSP application programs written in high-level 
languages such as C can be developed and debugged in familiar software en- 
gineering environments equipped with powerful tools and utilities. Afterward, 
these same programs can be recompiled with the TMS320C3x C compiler 
available for these same hosts, then benchmarked on the TMS320C3x soft- 
ware simulator for time and space using a version of SPOX designed specifi- 
cally for use with the TMS320C3x simulator or a hardware development sys- 
tem such as the XDS1 000. 

Spectron also offers a SPOX Porting Kit. This product includes an unbundled 
version of SPOX, whose generic components can be configured for the specif- 
ic TMS320C3x application and integrated with system-dependent software 
(drivers, math functions, etc.) supplied by the developer. 

SPOX is currently packaged with the TMS320C3x XDS1 000 Development En- 
vironment. For more information regarding SPOX, contact Spectron Microsys- 
tems at (805) 967-0503. 
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B.1.5 TMS320C3X Evaluation Module 

With the introduction of the TMS320C30 Evaluation Module (EVM), Texas In- 
struments has removed the cost barrier to evaluating and developing floating- 
point DSP applications. The TMS320C30 EVM is the first floating-point DSP 
tool that bridges the price-performance gap between software simulators and 
full-featured development platforms. 

Each EVM comes complete with a PC halfcard and software package. The 
EVM board contains 

Q One TMS320C30, a 33-MFLOP, 32-bit floating-point DSP 

Q 16K-word zero-wait-state SRAM, allowing coding of most algorithms di- 
rectly on the board 

Qi A speaker/microphone-ready analog interface for multimedia, speech, 
and audio applications development 

□ A multiprocessor serial port interface for connecting to multiple EVMs 

Qi A host port for PC communications 

The system also comes with all the software required to begin applications de- 
velopment on a PC host. Equipped with a C and assembly language source 
level debugger for DSP, the EVM has a window-oriented, mouse-driven inter- 
face that enables the downloading, executing, and debugging of assembly 
code or C code. See subsection B.1 .6, Emulator User interface, for more infor- 
mation. 

The TMS320C3x Assembler/Linker is also included with the EVM. For users 
who prefer programming in a high-level language, an optimizing ANSI C com- 
piler and Ada compiler are offered separately. 

B.1 .6 TMS320C3x Emulator — Extended Development System (XDS500 and 
XDS1000) 

The TMS320C3x XDS500 and XDS1000 Emulators are user-friendly, PC- 
based development systems, which provide all the features necessary to per- 
form full-speed in-circuit emulation with the TMS320C3x. These emulators al- 
low you to develop software and hardware and to integrate the software and 
hardware with the target system. A revolutionary scan path interface gives con- 
trol and access to every memory location and register of the TMS320C3x. Key 
features of the TMS320C3x emulators include 

Qi no cable length/transmission problems, 

Q a nonintrusive system, 

Q no loading problems on signals, 
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Qi no artificial memory limitations, 

Qi a common screen interface for ease of use, 

Qi easy installation, 

Q in-system emulation, 

Q no variance from data sheet. 

Full-speed execution and monitoring of the target system is performed by the 
4-wire interface or scan path via a 1 2-pin target connector. This scan path con- 
trols the TMS320C3x in the target application and provides access to all the 
registers as well as to internal and external memory of the device. Since pro- 
gram execution takes place on the TMS320C3x in the target system, there are 
no timing differences during emulation. This new design offers significant ad- 
vantages over traditional emulators: 

Qi no cable length/transmission problems, 

Qi a nonintrusive system, 

Q no loading problems on signals, 

tk no artificial memory limitations, 

ti a common screen interface for ease of use, 

Q easy installation, 

£k in-system emulation, 

Qi no variance from data sheet. 

The 12-pin target connector allows for emulation of multiprocessing applica- 
tions. For example, if five TMS320C3xs exist on one board, as shown in 
Figure B-4, each device is emulated by simply moving the 12-pin target con- 
nector from one TMS320C3x connector to the next. Real-time emulation is still 
maintained, and the information of each processor is preserved. 



Figure B-4. Multiprocessing Emulation 
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The TMS320C3x XDS1 000 is a full-speed emulator that comes with a prebuilt 
target system for early design development. The TMS320C3x XDS1 000 can 
help debug hardware in real time, debug software in real time, and integrate 
the hardware and software together. 

The XDS500 and XDS1 000 are available on an IBM PC/AT or compatible ma- 
chine with 640K of memory running PC/MS-DOS. The XDS500 is shipped with 
the macro assembler/linker. The XDS1000 is shipped with the optimizing C 
compiler, macro assembler/linker, and SPOX. 

Emulator User Interface 

Included with the XDS500 and XDS1000 systems is the C/assembly source 
debugger and a new menu-, mouse-, and window-oriented environment that 
is the standard for all TMS320 DSP interfaces. The user-friendly, state-of-the- 
art interface is flexible and easily customized for color display or monochrome 
monitors. Its features include 

Qi Fields that can be edited through the point-and-click capability of the 
mouse. 

□ Menus that provide a quick and easy alternative to the keyboard. 

Qi Resize and drag capabilities that allow you to define window size and loca- 
tion and to view as much information as you need. 

Q Smart displays that reconfigure the format of displayed data to fit window 
size and shape. 

Q Highlighted fields whenever program execution changes the field values. 

Q A command that allows you to save the screen configurations. 

Q Eight types of windows for debugging and configuring an environment. 
The windows can be in one of the modes described above or can be in any 
user-defined combination of up to 120 windows. The windows provided 
are the same as the simulator, shown in Figure B-2, and include 

a) Command Window for entering commands and displaying output 
and error messages. 

b) Memory Window for displaying, viewing, and editing contents of 
memory. 

c) Disassembly Window for displaying disassembled code. 

d) File Window for displaying the contents of any text file. 

e) CPU Window for displaying, viewing, and editing the CPU registers. 

f) Calls Window for displaying the current function call that a C program 
has made. 
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g) Watch Window for displaying values of selected variables, registers, 
or other C expressions. 

h) Display Windows for displaying all field elements of a selected struc- 
ture or array. 



B.1.7 Hewlett-Packard 64700 Analysis Subsystem 



The HP 64700 TMS320C30 Analysis Subsystem is an analysis tool that can 
be used with the TMS320C30 XDS500/1000 emulators to capture 
TMS320C30 bus cycle information in real time. The subsystem collects trace 
samples during a bus cycle and stores them into a trace buffer, which can be 
viewed for analysis and debugging. 

An enhanced version of the Tl user-friendly, windowed C/Assembly Source 
Level Debugger (the current emulator interface) is included with the subsys- 
tem. The debugger's basic feature set has been extended with additional win- 
dows and commands to provide access to the logic analysis capabilities of the 
subsystem. 

The analysis subsystem features are integrated into the window-driven C/as- 
sembly source debugger. This means the currentTMS320C30 developer does 
not need to learn a new interface or new software to take advantage of the sub- 
system's enhanced feature set. The interface utilizes the debugger's symbol 
table information so that trace information can be displayed in either assembly, 
C, or both simultaneously, further reducing debug time. 

The key features of the HP 64700 Subsystem include 

Qi Hardware breakpoints 

Ql Selectable tracing on the primary or expansion bus 

Ql Up to 1 024-deep trace buffer 

tl Two-deep prestore buffer 

Ql Flexible triggering 

£k Complete timing analysis 

Q Comprehensive display 

Q Action on external signals 
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B.1.8 TMS320 Third Parties 



The TMS320 family is supported by product and service offerings from more 
than 1 00 independent vendors and consultants, known as third parties. These 
support products take various forms (both software and hardware) from cross- 
assemblers, simulators, and DSP utility packages to logic analyzers and emu- 
lators. The expertise of those involved in support services ranges from speech 
encoding and vector quantization to software/hardware design and system 
analysis. 

For a more detailed description of services and products offered by third par- 
ties, referto the TMS320 Third Party Support Reference Guide (literature num- 
ber SPRU052). Call the Customer Response Center at 800-336-5236 to re- 
quest a copy. 
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B.2 TMS320 Literature/DSP Hotline/Bulletin Board Services 

Extensive DSP documentation is available; this includes data sheets, user's 
guides, and application reports. In addition, DSP textbooks that aid research 
and education have been published by Prentice-Hall, John Wiley and Sons, 
and Computer Science Press. To order literature or to subscribe to the DSP 
newsletter "Details on Signal Processing" (for up-to-date information on new 
products and services), call the Tl Customer Response Center (CRC) at (800) 
336-5236. 

For answers to TMS320 technical questions on device problems, development 
tools, documentation, upgrades, and new products, call the Tl DSP Hotline at 
(713) 274-2320 Monday-Friday from 8:00 a.m. to 6:00 p.m. central time. To 
ask about third-party applications and algorithm development packages, con- 
tact the third party directly. Refer to the TMS320 Third-Party Support Refer- 
ence Guide (SPRU052) for addresses and phone numbers. 

For information on 

□ TMS320 devices, 

Qi Specification updates for current or new devices and development tools, 

Q Development tool and silicon revisions and enhancements, 

Qi New DSP application software as it becomes available, and 

□ Source code from the TMS320C3x User's Guide, 

call the TMS320 DSP Bulletin Board Service (BBS). You can access this tele- 
phone-line computer service by dialing (713) 274-2323 on a 300-, 1200-, or 
2400-bps modem. To find out more about the BBS, look in the TMS320 Family 
Development Support Reference Guide (literature number SPRU011 B). 

Contact the nearest Tl Field Sales Office for prices and availability of TMS320 
devices and support tools. See the list of sales offices and distributors at the 
end of this book. 
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B.3 Technical Training Organization (TTO) TMS320 Workshops 

To register, contact the nearest Tl RTC. A 15-percent discount is available 
when three or more engineers from the same company enroll in the same work- 
shop. To enroll, call 800-336-5236, ext. 3904. 

B.3.1 TMS320C3x Design Workshop 

The TMS320C3x DSP Design Workshop is tailored for hardware and software 
design engineers and decision-makers who will be designing and utilizing the 
TMS320C3x generation of DSP devices. Hands-on exercises throughout the 
course give the designer a rapid start in utilizing TMS320C3x design skills. Mi- 
croprocessor/assembly language experience is required. Experience with dig- 
ital design techniques and C language programming experience is desirable. 
These topics are covered in the TMS320C3x workshop: 

□i TMS320C3X architecture/instruction set 

Q Use of the PC-based TMS320C3x software simulator and EVM 

Qi Floating-point and parallel operations 

□ Use of the TMS320C3x assembler/linker 
Qi C programming environment 

Q System architecture considerations 

□ Memory and I/O interfacing 

Q TMS320C3x development support 
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B.4 TMS320C3x Part Order Information 



This section provides the device and support tool part numbers. Table B-1 lists 
the part numbers for the TMS320C30 and TMS320C31 , and Table B-2 gives 
ordering information for TMS320C3x hardware and software support tools. An 
explanation of the TMS320 family device and development support tool prefix 
and suffix designators follows the two tables to assist in understanding the 
TMS320 product numbering system. 



Table B-1. TMS320C3x Digital Signal Processor Part Numbers 



Device 


Technology 


Operating 
Frequency 


Package 
Type 


Typical Power 
Dissipation 


TMS320C30GBL 


1.0-UmCMOS 


33 MHz 


Ceramic 181 -pin PGA 


1.00 w 


TMS320C30GBL27 


1.0-UmCMOS 


27 MHz 


Ceramic 181 -pin PGA 


0.88 W 


TMS320C30GBL40 


1.0-UmCMOS 


40 MHz 


Ceramic 188-pin PGA 


1.25 W 


TMS320C31 PQL 


0.8-Um CMOS 


33 MHz 


Plastic 1 32-pin QFP 


0.75 W 


TMS320C31 PQL27 


0.8-U.m CMOS 


27 MHz 


Plastic 1 32-pin QFP 


0.63 W 


SMJ320C30GBM28 
SMJ320C30HUM28 
SMJ320C30HTM28 


1 .0-|im CMOS 


28 MHz 


Ceramic 181 -pin PGA 
or Ceramic 196-pin QFP 


1.00 W 
1.00 W 


SMJ320C30GBM25 
SMJ320C30HUM25 
SMJ320C30HTM25 


1 .0-(lm CMOS 


25 MHz 


Ceramic 181 -pin PGA 
or Ceramic 1 96-pin QFP 


1.00 W 
1.00 W 



Table B-2. TMS320C3x Support Tool Part Numbers 



Tool Description 


Operating System 


Part Number 


Software 


C Compiler & Macro Assembler/ Linker 


VAXVMS 
PC-DOS/MS-DOS 
SUN UNIX t 
VAX Ultrix 
MAC-MPW 


TMDS3243255-08 
TMDS3243855-02 
TMDS3243555-08 
TMDS3243265-08 
TMDS3243565-01 


Evaluation Module (EVM) 


PC-DOS/MS-DOS 


TMDX3260030 


HP Trace Analyzer 


PC-DOS/MS-DOS 


TMDX326HP30 



t Note that SUN UNIX supports TMS320C3x software tools on the 68000 family-based SUN-3 series 
workstations and on the SUN-4 series machines that use the SPARC processor, but not on the 
SUN-386i series of workstations. 

* SPOX is currently packaged with XDS1000 Development Environment. 
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Table B-2. TMS320C3x Support Tool Part Numbers (Concluded) 



Tool Description 


Operating System 


Part Number 


Software (Concluded) 


Macro Assembler/Linker 


VAX VMS 

PC-DOS/MS-DOS; OS/2 
SUN UNIX t 
MAC-MPW 


TMDS3243250-08 
TMDS3243850-02 
TMDS3243550-08 
TMDS3243560-01 


Operating System (SPOX) 


PC-DOS/MS-DOS* 


Offered by Spectron 
Inc. (805) 967-0503 


Simulator 


VAX VMS 
PC-DOS/MS-DOS 
SUN UNIXt 


TMDS3243251-08 
TMDS3243851-02 
TMDS3243551-09 


Hardware 


XDS500 Emulator 


PC/MS-DOS 


TMDS3260131 


XDS1000 Development Environment 


PC/MS-DOS 


TMDS3261030 



t Note that SUN UNIX supports TMS320C3x software tools on the 68000 family-based SUN-3 se- 
ries workstations and on the SUN-4 series machines that use the SPARC processor, but not on 
the SUN-386i series of workstations. 

* SPOX is currently packaged with XDS1000 Development Environment. 

B.4.1 Device and Development Support Tool Prefix Designators 

Prefixes to Texas Instruments' part numbers designate phases in the product's 
development stage for both devices and support tools, as shown in the follow- 
ing definitions: 

Device Development Evolutionary Flow: 

TMX Experimental device that is not necessarily representative of the final 
device's electrical specifications. 

TMP Final silicon die that conforms to the device's electrical specifications 
but has not completed quality and reliability verification. 

TMS Fully qualified production device. 
Support Tool Development Evolutionary Flow: 

TMDX Development support product that has not yet completed Texas Instru- 
ments' internal qualification testing for development systems. 

TMDS Fully qualified development support product. 

TMX and TMP devices and TMDX development support tools are shipped with 
the following disclaimer: 

"Developmental product is intended for internal evaluation purposes." 
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TMS320C3x Part Order Information 



Note: Prototype Devices 

Texas Instruments recommends that prototype devices (TMX or TMP) not be 
used in production systems because their expected end-use failure rate is 
undefined but predicted to be greater than standard qualified production de- 
vices. 



TMS devices and TMDS development support tools have been fully character- 
ized, and their quality and reliability have been fully demonstrated. Texas In- 
struments' standard warranty applies to TMS devices and TMDS development 
support tools. 

TMDX development support products are intended for internal evaluation pur- 
poses only. They are covered by Texas Instruments' Warranty and Update 
Policy for Microprocessor Development Systems products; however, they 
should be used by customers only with the understanding that they are devel- 
opmental in nature. 
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B.4.2 Device Suffixes 



The suffix indicates the package type (e.g., N, FN, or GB) and temperature 
range (e.g., L). 

Figure B-5 presents a legend for reading the complete device name for any 
TMS320 family member. 



Figure B-5. TMS320 Device Nomenclature 



TMS 320 C 30 GB 



Prefix 



TMX= Experimental Device 
TMP= Prototype Device 
TMS= Qualified Device 
SMJ= MIL-STD-883C 



Device Family- 



320 = TMS320 Family 



Technology 

C = CMOS 

E = CMOS EPROM 

No Letter = NMOS 



Device 

1st-generation DSP: 

10 

14 

15 

16 

17 
2nd-generation DSP: 

20 

25 

26 
3rd-generation DSP: 

30 

31 
4th-generation DSP: 

40 
5th-generation DSP: 

50 

51 



L Temperature Range 

H = to 50°C 
L = 0to70°C 
S = -55 to 1 00°C 
M= -55to125°C 
A = -40 to 85°C 



Package Type 



N = Plastic DIP 

JD = Ceramic DIP Side-Brazed 

FN = Plastic Leaded CC 

GB = Ceramic PGA 

FJ = Ceramic Leaded CC 

FD = Leadless Ceramic CC 

FZ = Ceramic Leaded CC 

GE = Ceramic PGA, Glass Seal 

HU = Ceramic quad flatpack 

HT = Ceramic quad flatpack 

(gull wing) 
PQ = Plastic quad flatpack 
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Chapter C 

Quality and Reliability 



The quality and reliability of Texas Instruments microprocessor and 
microcontroller products, which include TMS320 digital signal processors, re- 
lies on feedback from 

Ql Our customers, 

Ql Our total manufacturing operation from front-end wafer fabrication to final 
shipping inspection, and 

Ql Product quality and reliability monitoring. 

Our customer's perception of quality is the governing criterion forjudging per- 
formance. This concept is the basis for Texas Instruments Corporate Quality 
Policy, which is as follows: 

"For every product or service we offer, we shall define the requirements that 
solve the customer's problems, and we shall conform to those requirements 
without exception." 

Texas Instruments has developed a leadership reliability qualification system, 
based on years of experience with leading-edge memory technology and on 
years of research into customer requirements. In orderto achieve constant im- 
provement, programs that support that system respond to customer input and 
internal information. 
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C.1 Reliability Stress Tests 

Accelerated stress tests are performed on new semiconductor products and 
process changes in order to qualify them and to ensure excellence in product 
reliability. These test environments are typical: 

□l High-temperature operating life 

Q Storage life 

Qi Temperature cycling 

Ql Biased humidity 

Qi Autoclave 

Qi Electrostatic discharge 

Qi Package integrity 

Q Electromigration 

Q Channel-hot electrons (performed on geometries less than 2.0 urn). 

Typical events or changes that require internal requalification of a product in- 
clude 

Q New die design, shrink, or layout 

□ Wafer process (baseline/control systems, flow, mask, chemicals, gases, 
dopants, passivation, or metal systems) 

Q Packaging assembly (baseline control systems or critical assembly equip- 
ment) 

Q Piece parts (such as lead frame, mold compound, mount material, bond 
wire, or lead finish) 

Qi Manufacturing site 

Tl reliability control systems extend beyond qualification. Total reliability con- 
trols and management include product reliability monitoring as well as final 
product release controls. MOS memories, utilizing high-density active ele- 
ments, serve as the leading indicator in wafer-process integrity at Tl MOS fabri- 
cation sites, enhancing all MOS logic device yields and reliability. Tl places 
more than several thousand MOS devices per month on reliability tests to en- 
sure and sustain built-in product excellence. 

Table C-1 lists the microprocessor and microcontroller reliability tests, the du- 
ration of the test, and sample size. Definitions and descriptions of those tests 
precede the table. 
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AOQ (Average Outgoing Quality) 
FIT (Failure In Time) 

Operating Lifetest 



High-Temperature Storage 
Biased Humidity 

Autoclave (Pressure Cooker) 



Temperature Cycle 



Electrostatic Discharge 



Thermal Shock 



PIND(Particle Impact Noise 
Detection) 

Mechanical Sequence: 

Fine and gross leak 
Mechanical shock 

PIND (optional) 



Amount of defective product in a population, usually 
expressed in terms of parts per million (PPM). 

Estimated field failure rate in number of failures per 
billion power-on device hours; 1 000 FITS equal 0.1 
percent failure per 1000 device hours. 

Device dynamically exercised at a high ambient 
temperature (usually 125 °C) to simulate field 
usage that would expose the device to a much lower 
ambient temperature (such as 55 °C). Using a 
derived high temperature, a 55°C ambient failure 
rate can be calculated. 

Device exposed to 150 °C unbiased condition. 
Bond integrity is stressed in this environment. 

Moisture and bias used to accelerate corrosion- 
type failures in plastic packages. Conditions include 
85 °C ambient temperature with 85 percent relative 
humidity (RH). Typical bias voltage is +5 V and is 
grounded on alternating pins. 

Plastic-packaged devices exposed to moisture at 
1 21 °C using a pressure of one atmosphere above 
normal pressure. The pressure forces moisture 
permeation of the package and accelerates corro- 
sion mechanisms (if present) on the device. Exter- 
nal package contaminants can also be activated 
and caused to generate inter-pin current leakage 
paths. 

Device exposed to severe temperature extremes in 
an alternating fashion (-65 °C for 15 minutes and 
150 °C for 15 minutes per cycle) for at least 1000 
cycles. Package strength, bond quality, and consis- 
tency of assembly process are tested in this envi- 
ronment. 

Device exposed to electrostatic discharge pulses. 
Calibration is according to MIL STD 883C, method 
3015.6. Devices are stressed to determine failure 
threshold of the design. 

Test similar to the temperature cycle test, but involv- 
ing a liquid-to-liquid transfer, per MIL-STD-883C, 
Method 1011. 

A nondestructive test to detect loose particles 
inside a device cavity. 



Per MIL-STD-883C, Method 1014 
Per MIL-STD-883C, Method 2002, 
1500 g, 0.5 ms, Condition B 
Per MIL-STD-883C, Method 2020 
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Vibration, variable frequency 

Constant acceleration 

Fine and gross leak 
Electrical test 

Thermal Sequence: 

Fine and gross leak 
Solder heat (optional) 
Temperature cycle 

(10 cycles minimum 
Thermal shock 
(10 cycles minimum) 
Moisture resistance 
Fine and gross leak 
Electrical test 

Thermal/Mechanical Sequence: 

Fine and gross leak 
Temperature cycle 

(10 cycles minimum) 
Constant acceleration 

Fine and gross leak 
Electrical test 
Electrostatic discharge 
Solderability 
Solder heat 

Salt atmosphere 

Lead pull 

Lead integrity 

Electromigration 

Resistance to solvents 



Per MIL-STD-883C, Method 2007, 
20 g, Condition A 
Per MIL-STD-883C, Method 2001 , 
20 kg, Condition D, Y1 Plane min 
Per MIL-STD-883C, Method 1014 
To data sheet limits 



Per MIL-STD- 
Per MIL-STD- 
Per MIL-STD- 
- 65 to + 1 50 
Per MIL-STD- 
-55to+125° 
PerMIL-STD- 
Per MIL-STD- 
To data sheet 



■883.C, Method 1014 
•750C, Method 1014 
■883C, Method 1010, 
°C, Condition C 
883C, Method 1011, 
C, Condition B 
883C, Method 1004 
883C, Method 1014 
limits 



Per MIL-STD-883C, Method 1014 

Per MIL-STD-883C, Method 1010, 

- 65 to +150 °C, Condition C 

Per MIL-STD-883C, Method 2001 , 

30 kg, Y1 Plane 

Per MIL-STD-883C, Method 1014 

To data sheet limits 

Per MIL-STD-883C, Method 3015 

Per MIL-STD-883C, Method 2033 

Per MIL-STD-750C, Method 2031 , 

10 sec 

Per MIL-STD-883C, Method 1009, 

Condition A, 24 hrs min 

Per MIL-STD-883C, Method 2004, 

Condition A 

Per MIL-STD-883C, Method 2004, 

Condition B1 

Accelerated stress testing of conductor 

patterns to ensure acceptable lifetime of 

power-on operation 

Per MIL-STD-883C, Method 2015 
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Table C-1. Microprocessor and Microcontroller Tests 



Test 


Duration 


Sample Size 
Plastic Ceramic 


Operating life, 125 °C, 5.0 V 


1000hrs 


129 129 


Storage life, 150 °C 


1000hrs 


45t 45 


Biased 5 °C/85 percent RH, 5.0 V 


1000hrs 


77 


Autoclave, 121 °C, 1 ATM 


240 hrs 


45 


Temperature cycle, - 65 to 1 50 °C 


1000 eye* 


77 77 


Temperature cycle, to 1 25 °C 


3000 eye 


77 77 


Thermal shock,- 65 to 1 50 °C 


200 eye 


77 77 


Electrostatic discharge, ±2 kV 




15 15 


Latch-up (CMOS devices only) 




5 5 


Mechanical sequence 




22 


Thermal sequence 




22 


Thermal/mechanical sequence 




22 


PIND 




45 


Internal water vapor 




3 


Solderability 




22 22 


Solder heat 




22 22 


Resistance to solvents 




15 15 


Lead integrity 




15 15 


Lead pull 




22 


Lead finish adhesion 




15 15 


Salt atmosphere 




15 15 


Flammability (UL94-V0) 




3 


Thermal impedance 




5 5 



t if junction temperature does not exceed plasticity of package. 
* For severe environments; reduced cycles for office environments. 
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Table C-2 lists the TMS320C3x devices, the approximate number of transis- 
tors, and the equivalent gates. The numbers have been determined from de- 
sign verification runs. 



Table C-2. TMS320C3x Transistors 



Device 


# Transistors 


# Gates 


CMOS: TMS320C30 


600K — 700K 


200K 


CMOS:TMS320C31 


500K — 600K 


100K 



Note: 

Texas Instruments reserves the right to make changes in MOS Semiconduc- 
tor test limits, procedures, or processing without notice. Unless prior ar- 
rangements for notification have been made, Tl advises all customers to re- 
verify current test and manufacturing conditions prior to relying on published 
data. 
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Appendix D 

Calculation of TMS320C30 Power Dissipation 



The TMS320C30 is a state-of-the-art, high-performance, 32-bit floating-point 
DSP microprocessor fabricated in CMOS technology. This device is the first 
member of the third generation of TMS320 family single-chip DSP micropro- 
cessors. Since 1 982 when the first-generation TMS3201 was introduced, the 
TMS320 family has established itself as the industry standard for digital signal 
processing. The TMS320C30's innovative architecture and specialized in- 
struction set provide high speed and increased flexibility for DSP applications. 
This combination makes it possible to execute upto 40 MFLOPS (million float- 
ing point operations per second). 

As device sophistication and levels of integration increase with evolving semi- 
conductor technologies, actual levels of power dissipation vary widely and de- 
pend heavily on the particular application in which the device is used and the 
nature of the program being executed. In addition, due to the inherent charac- 
teristics of CMOS technology, power requirements vary according to clock 
rates and data values being processed. 

This application report presents the information necessary to determine 
TMS320C30 power supply current requirements under different operating 
conditions. With this information, you can determine the power dissipation for 
the device, which, in turn, you can use to calculate thermal management re- 
quirements. 

The major topics discussed in this application report are as follows: 

Qi Fundamental Power Dissipation Characteristics 

■ Components of Power Supply Current Requirements 

■ Dependencies 

■ Test Setup Description 

Ql Current Requirement of Internal Circuitry 

■ Quiescent 

■ Internal Operations 

■ Internal Bus Operations 

Ql Current Requirement of Output Driver Circuitry 
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■ Primary Bus 

■ Expansion Bus 

■ Data Dependency 

■ Capacitive Load Dependence 
Qi Calculation of Total Supply Current 

■ Combining Supply Current Due to All Components 

B Supply Voltage, Operating Frequency, and Temperature Dependen- 
cies 

■ Design Equation 

■ Peak Versus Average Current 

■ Thermal Management Considerations 
Qi Example Supply Current Calculations 
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Fundamental Power Dissipation Characteristics 

D.1 Fundamental Power Dissipation Characteristics 

Typically, an IC's (integrated circuit) power specification is expressed as a 
function of operating frequency, supply voltage, operating temperature, and 
output load. As devices become more complex, the specification must also be 
based on device functionality. CMOS devices inherently draw current only dur- 
ing switching through the linear region. Therefore, the power supply current is 
related to the rate of switching. Furthermore, since the output drivers of the 
TMS320C30 are specified to drive DC loads, the power supply current result- 
ing from external writes depends not only on switching rate but also on the val- 
ue of data written. 

D.1.1 Components of Power Supply Current Requirements 

There are four basic components of the power supply current: 

Q Quiescent 

Qi Internal Operations 

Q Internal Bus Operations 

Q External Bus Operations 



D.1 .2 Dependencies 



The power supply current consumption depends on many factors. Four are 
system related: 

□ Operating frequency 
Qi Supply voltage 

Q Operating temperature 

□ Output load 

and several others are related to TMS320C30 operation: 
Q Duty cycle of operations 

□ Number of buses used 
Qi Wait states 

Q Cache usage 

□i Data value 

The total power supply current for the device is described in an equation apply- 
ing the four basic power supply current components and the dependencies de- 
scribed above. This equation is as follows: 
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' — Uq + 'iops + hbus + 'xbus) x FV X T 

where 

/q is the quiescent current component, 

hops ' s tri e current component due to internal operations, 

hbus ' s the current component due to internal bus usage including data value 
and cycle time dependencies, 

kbus ' s the current component due to external bus usage including data 
value, wait state, cycle time, and capacitive load dependencies, 

FV is a scale factor for frequency and supply voltage, and 

T is a scale factor for operating temperature. 

Application of this equation and determination of all the dependencies are de- 
scribed in detail in this application report. 

While this document explains, in detail, howto determine the power supply cur- 
rent requirement for the TMS320C30, if a less detailed analysis is sufficient, 
the minimum, typical, and maximum values can be used to determine a rough 
estimate of the power supply current requirements. The minimum power sup- 
ply current requirement is 1 1 mA. The typical and average current consump- 
tion is 250 mA as described in the TMS320C30 data sheet and will be asso- 
ciated with most algorithms running on the device unless excessive data out- 
put is being performed. 



The maximum current requirement is 600 mA and occurs only un- 
der WORST CASE conditions: writing alternating data 
(AAAAAAAAh to 55555555h) out of both external buses simulta- 
neously, every cycle, with 80 pF loads and running at 33 MHz. 



If an extremely conservative approach is desired, the maximum value can be 
used. 
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Figure D-1. Current Measurement Test Setup 
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D.1.3 Determining Algorithm Partitioning 



Each part of an algorithm behaves differently, depending on its internal and ex- 
ternal bus usage. To analyze the power supply current requirement, you must 
partition an algorithm into segments with distinct concentrations of internal or 
external bus usage. The analysis that follows is applied to each distinct pro- 
gram segment to determine the power supply current requirement for that sec- 
tion. The average power supply current requirement can then be calculated 
from the requirements of each segment of the algorithm. 



D.1.4 Test Setup Description 



All TMS320C30 supply current measurements were performed on the test set- 
up shown in Figure D-1 . The test setup consists of a TMS320C30, 8K of zero 
wait-state Cypress Semiconductor SRAMs (CY7C186-25PC), and RC loads 
on all data and address lines. A Tektronix Current Probe (P6042) measures the 
power supply current in all Vqd 'ines of the device. The supply voltage on the 
output load is 2.1 5 V. Unless otherwise specified, all measurements are made 
at a supply voltage of 5.0 V, an input clock frequency of 33 MHz, a capacitive 
load of 80 pF, and an operating temperature of 25°C. 
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D.2 Current Requirement of Internal Circuitry 



D.2.1 Quiescent 



The power supply current requirement for internal circuitry consists of three 
components: quiescent, internal operations, and internal bus operations. 
Quiescent and internal operations are constants, but the internal bus opera- 
tions component varies with the rate of internal bus usage and the data values 
being transferred. 



Quiescent refers to the baseline supply current drawn by the TMS320C30 dur- 
ing minimal internal activity, such as executing the IDLE instruction or branch- 
ing to self. It includes the current required to fetch an instruction from on- or 
off-chip memory. The quiescent requirement for the TMS320C30 is 110 mA. 
Examples of quiescent current include: 

□i Maintaining timers and serial ports 

Q Executing the IDLE instruction 

Qi TMS320C30 in HOLD mode pending external bus access 

Q TMS320C30 in reset 

Ql Branching to self 



D.2.2 Internal Operations 



Internal operations are those that require more current than quiescent activity 
but do not include external bus usage or significant internal bus usage. Internal 
operations include register-to-register multiplication, ALU operations, and 
branches. They add a constant 55 mA above the quiescent so that the total 
contribution of quiescent and internal operations is 165 mA. Note, however, 
that internal and/or external bus operations executed via an RPTS instruction 
do not contribute an internal operations power supply current component and 
hence do not add 55 mA to quiescent current. During an instruction in RPTS, 
activity other than the instruction being repeated is suspended; therefore, pow- 
er supply current is related only to the operation performed by the instruction 
being executed. The next contributing factor to the power supply current re- 
quirement is internal bus operations. 
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D.2.3 Internal Bus Operations 



The internal bus operations include all operations that utilize the internal buses 
extensively, such as accessing internal RAM every cycle. No distinction is 
made between internal reads (such as instruction or operand fetches from in- 
ternal ROM or internal RAM banks) and internal writes (such as operand stores 
to internal RAM banks), because internally they are equal. Significant use of 
internal buses adds a term to the power supply current requirement that is data 
dependent. Recall that switching requires more current. Hence, moving 
changing data at high rates requires higher power supply current. 

Pipeline conflicts, use of cache, fetches from external wait-state memory, and 
writes to external wait-state memory all effect the internal and external bus 
cycles of an algorithm executing on the TMS320C30. Therefore, the internal 
bus usage of the algorithm must be determined in order to accurately calculate 
power supply current requirements. The TMS320C30 software simulator and 
XDS emulator both provide benchmarking and timing capabilities that allow 
bus usage to be determined. 

The current resulting from internal bus usage varies roughly exponentially with 
transfer rates. Figure D-2 shows internal bus current requirements for trans- 
ferring alternating data (AAAAAAAAh to 55555555h) at several transfer rates 
(expressed as the transfer cycle time). A transfer rate less than one implies 
multiple accesses per single H1 cycle (that is, using DMA, etc.). Transfer cycle 
times greater than one refer to single-cycle transfers with one or more cycles 
between them. The minimum transfer cycle time is one third, which corre- 
sponds to three accesses in a single H1 cycle. 

The data set AAAAAAAAh to 55555555h exhibits the maximum current for 
these types of operations. Less current is required for transferring other data 
patterns, and current values may be derated accordingly as described later in 
this subsection. 

As the transfer rate decreases (that is, transfer cycle time increases) the incre- 
mental Iqd approaches mA. Transfer rates corresponding to more than 7 H1 
cycles do not add any current and are considered insignificant. This figure rep- 
resents the incremental Iqd due to internal bus operations and is added to 
quiescent and internal operations current values. 

For example, the maximum transfer rate corresponds to three accesses every 
cycle or one-third H1 transfer cycle time. At this rate, 85 mA is added to the 
quiescent (110 mA) and internal operation (55 mA) current values for a total 
of 250 mA. 



D-7 



Current Requirement of Internal Circuitry 



Figure D-2. Internal Bus Current Versus Transfer Rate 
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Figure D-2 shows the internal bus current requirement when transferring As 
followed by 5s for various transfer rates. Figure D-3 shows the data depen- 
dence of the internal bus current requirement when the data is other than As 
followed by 5s. The trapezoidal region bounds all possible data values trans- 
ferred. The lower line represents the scale factor for transferring the same 
data. The upper line represents the scale factor fortransferring alternating data 
(all 0s to all Fs or all As to all 5s, etc.). 

Since the possible permutations of data values is quite large, the extent to 
which data varies is referred to as relative data complexity. This term repre- 
sents a relative measure of the extent to which data values are changing and 
the extent to which the number of bits are changing state. Therefore, relative 
data complexity ranges from 0, signifying minimal variation of data, to a nor- 
malized value of 1 , signifying greatest data variation. 

If a statistical knowledge of the data exists, Figure D-3 can be used to deter- 
mine the exact power supply requirement based on internal bus usage. For ex- 
ample, Figure D-3 indicates a 63% scale factor when all Fs are moved inter- 
nally every cycle with two accesses per cycle. This scale factor is multiplied by 
55 mA (from Figure D-2, at one-half H1 cycle transfer time), yielding 34.65 mA 
because of internal bus usage. Therefore, an algorithm running under these 
conditions requires about 200 mA of power supply current (1 1 + 55 + 34.65). 

Since a statistical knowledge of the data may not be readily available, a nomi- 
nal scale factor may be used. The median between the minimum and maxi- 
mum values at 50% relative data complexity yields a value of 0.80 and can be 
used as an estimate of a nominal scale factor. Therefore, this nominal data 
scale factor of 80% can be used for internal bus data dependency, adding 44 
mA to 1 1 m A (quiescent) and 55 mA (internal operations) to yield 21 mA. As 
an upper bound, assume worst case conditions of three accesses of alternat- 
ing data every cycle, adding 85 mA to 11 mA (quiescent) and 55 mA (internal 
operations) to yield 250 mA. 
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Figure D-3. Internal Bus Current Versus Data Complexity Derating Curve 
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D.3 Current Requirement of Output Driver Circuitry 



The output driver circuits on the TMS320C30 are required to drive significantly 
higher DC and capacitive loads than internal device logic. Therefore, they are 
designed to drive larger currents than internal devices. Because of this, output 
drivers impose higher supply current requirements than other sections of cir- 
cuitry on the device. 

Accordingly, the highest values of supply current are exhibited when external 
writes are being performed at high speed. During reads, or when the external 
buses are not being used, the TMS320C30 is not driving the data bus; this elim- 
inates the most significant component of output buffer current. Furthermore, 
in typical cases, only a few address lines are changing, or the whole address 
bus is static. Under these conditions, an insignificant amount of supply current 
is consumed. Therefore, when no external writes are being performed or when 
writes are performed infrequently, current due to output buffer circuitry can be 
ignored. 

When external writes are being performed, the current required to supply the 
output buffers depends on several different considerations. As with internal 
bus operations, current required for output drivers depends on the data being 
transferred and the rate at which transfers are being made. Additionally, output 
driver current requirements depend on the number of wait states implemented, 
because wait states affect rates at which bus signals switch. Finally, current 
values are also dependent upon external bus DC and capacitive loading. 

External operations involve writes external to the device and constitute the ma- 
jor power supply current component. The power supply current for the external 
buses is made up of three components and is summarized in the following 
equation: 

'base "*" 'prim "*" 'exp 

where 

base = 60-mA baseline current component, 

Iprim ' s ^e primary bus current component, 

l exp is the expansion bus current component. 

The remainder of this section describes in detail the calculation of external bus 
current components. 
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The current due to primary bus writes varies roughly exponentially with both 
wait states and write cycle time. Also, current components due to output driver 
circuitry are represented as offsets from the baseline value. Since the baseline 
value is related to internal current components, negative values for current off- 
set are obtained under some circumstances. Note, however, that actual nega- 
tive current does not occur. 

As mentioned in the previous section, to obtain accurate current values, timing 
of write cycles on the buses must first be established. This is accomplished by 
analyzing program activity, including any pipeline conflicts that may exist, to 
determine the rate and timings at which write cycles to the external buses oc- 
cur. Information from the TMS320C30 emulator or simulator as well as the 
TMS320C3x User's Guide is used to make these determinations. Note that ef- 
fects from the use of cache must also be accounted for in these analyses, be- 
cause use of cache can effect whether or not instructions are fetched from ex- 
ternal memory. 

When evaluating external write activity in a given program segment, a consid- 
eration that must be made is whether or not a particular level of external write 
activity constitutes significant activity. If writes are being performed at a slow 
enough rate, they do not significantly impact supply current requirements; 
therefore, current due to external writes can be ignored. This is the case, how- 
ever, only if writes are being performed at very slow rates on both the primary 
and the expansion buses. If writes are being performed at high speed on only 
one of the two external buses, the approach described in this section should 
still be used for calculation of current requirements. 

Note that although negative incremental current values are obtained under 
some circumstances, the total contribution for external buses, including base- 
line current, must always be positive. This is because, when external buses are 
used minimally, total current requirements always approach the current contri- 
bution due to internal components, which is solely a function of internal activity. 
This places a lower limit on current contributions resulting from the primary and 
expansion buses, because the total current due to external buses is the sum 
of the 60-mA baseline value and the primary and expansion bus components. 
This effect is discussed in further detail in the rest of this subsection. 

When bus-write cycle timing has been established, Figure D-4 can be used 
to determine the contribution to supply current due to this bus activity. 
Figure D-4 shows values of current contribution from the primary bus for vari- 
ous numbers of wait states and H1 cycles between writes. These characteris- 
tics are exhibited when writes of alternating 55555555h and AAAAAAAAh are 
being performed at a capacitive load of 80 pF per output signal line. The condi- 
tions exhibit the highest current values on the device. The values presented 
in the figure represent incremental or additional current contributed by the pri- 
mary bus output driver circuitry under the given conditions. Current values ob- 
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tained from this graph are later scaled and added to several other current terms 
to calculate the total current for the device. As indicated in the figure, the lower 
curve represents the current contribution for 1 8 or more cycles between writes. 

Figure D-4. Primary Bus Current Versus Transfer Rate and Wait States 

Primary Bus Analysis [80 pF, As/5s] 
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Note that "number of cycles between writes" refers to the number of H1 cycles 
between the active portion of the write cycle s as de f ined in t he TMS320 C3x 
User's GuideAhaX is, between H1 cycles when STRB, MSTRB, or IOSTRB and 
R/W (or XR/W as the case may be) are low. As shown in Figure D-4, the mini- 
mum number of cycles between writes is one because with back-to-back writes 
there is one H1 cycle between active portions of the writes. 

To further illustrate the relationship of current and write cycle time, Figure D-5 
shows the characteristics of current for various numbers of cycles between 
writes for zero wait states. The information on this curve can be used to obtain 
more precise values of current if zero wait states are being used and the num- 
ber of cycles between writes does not fall on one of the curves in Figure D-4. 



D-12 



Calculation of TMS320C30 Power Dissipation 



Current Requirement of Output Driver Circuitry 

Figure D-5. Primary Bus Current Versus Transfer Rate at Zero Wait States 

Primary Bus Duty Cycle Analysis [80 pF, As/5s] 
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Figure D-6. Expansion Bus Current Versus Transfer Rate and Wait States 

Expansion Bus Analysis [80 pF, As/5s] 
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Note that although these graphs contain negative current values, this does not 
indicate that negative current actually occurs. The negative values exist be- 
cause the graphs represent a current offset from a common baseline current 
value, which is not necessarily the lowest current exhibited. Using this ap- 
proach to depict current contributions due to different components simplifies 
current calculations because it allows calculations to be made independently. 
Independent calculations can be made because information about relation- 
ships between different sections of the device are included implicitly in the in- 
formation for each section. 
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Figure D-4and Figure D-5 show that the contribution of writes for external bus 
activities becomes insignificant if writes are being performed with more than 
1 8 cycles between each write. Under these conditions, the incremental value 
of-30-mA current contribution due to the primary bus should be used. Note, 
however, that a value of -30 mA should be used only if the expansion bus is 
being used extensively, because the total contribution for external buses in- 
cluding baseline current must always be positive. If the expansion bus is not 
being used and the primary bus is being used minimally, the current contribu- 
tion due to the primary bus must always be greater than or equal to 20 mA. This 
ensures that the correct total current value is obtained when summing external 
bus components. Once a current value has been obtained from Figure D-4 or 
Figure D-5, this value may be scaled by a data dependency factor if neces- 
sary, as described at the end of this section. This scaled value is then summed 
along with several other current terms to determine the total supply current. 
Calculation of total supply current is described in detail in the next section. 



D.3.2 Expansion Bus 



Currents due to the primary and expansion buses are similar in characteristics 
but differ slightly because of several factors, including the fact that the expan- 
sion bus has 1 1 fewer address outputs than the primary bus (1 3 rather than 24). 
This difference is exhibited in a slightly lower overall current contribution from 
the expansion bus than from the primary bus. 

Accordingly, determining expansion bus current follows the same basic prem- 
ises as determining the the primary bus current. Figure D-6 and Figure D-7 
show the same current relationships for the expansion bus as Figure D-4 and 
Figure D-5 show for the primary bus. Also, since the total external buses' cur- 
rent contribution must be positive, if the primary bus is not being used and the 
expansion bus is being used minimally, then the minimum current contribution 
due to the expansion bus is -30 mA. Finally, as with the primary bus, current 
values obtained from these figures may require scaling by a data dependency 
factor as described in the next subsection. 
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Figure D-7. Expansion Bus Current Versus Transfer Rate at Zero Wait States 

Expansion Bus Duty Cycle Analysis [80 pF, As/5s] 
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D.3.3 Data Dependency 



Data dependency of current for the primary and expansion buses is expressed 
as a scale factor that is a percentage of the maximum current exhibited by ei- 
ther of the two buses. Datadependenciesforthe primary and expansion buses 
are shown in Figure D-8 and Figure D-9, respectively. 

These two figures show normalized weighting factors that can be used to scale 
current requirements on the basis of patterns in data being written on the exter- 
nal buses. The range of possible weighting factors forms a trapezoidal pattern 
bounded by extremes of data values. As can be seen from Figure D-8 and 
Figure D-9, the minimum current is exhibited by writing all zeros, while the 
maximum current occurs when writing alternating 55555555h and 
AAAAAAAAh. This condition results in a weighting factor of one, which corre- 
sponds to using the values from Figure D-4 and/or Figure D-5 directly. 

As with internal bus operations, data dependencies for the external buses are 
well defined, but accurate prediction of data patterns is often either impossible 
or impractical. Therefore, unless precise knowledge of data patterns exists, an 
estimate of a median or average value for scale factor should be used. If it is 
assumed that data will be neither 5s and As nor all 0s and will be varying ran- 
domly, then a value of 0.85 is appropriate. Otherwise, if a conservative ap- 
proach is preferred, a value of 1 .0 can be used as an upper bound. 
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Regardless of the approach taken for scaling, once scale factors for primary 
and expansion buses have been determined, apply these factors to scale cur- 
rent values determined by using the graphs in the previous two subsections. 
For example, if a nominal scale factor of 0.85 is used and the system uses 
wait states with 2 cycles between accesses on both the primary and expansion 
buses, the current contribution from the two buses is as follows: 

Primary: 0.85 x 80 mA = 68 mA 

Expansion: 0.85 x 40 mA = 34 mA 



Figure D-8. Primary Bus Current Versus Data Complexity Derating Curve 
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Figure D-9. Expansion Bus Current Versus Data Complexity Derating Curve 
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D.3.4 Capacitive Load Dependence 



Once cycle timing and data dependencies have been accounted for, capacitive 
loading effects should be included in a similar manner to that of data dependen- 
cy. Figure D-1 shows the scale factor to be applied to the current values ob- 
tained above as a function of actual load capacitance if the load capacitance 
presented to the buses is less than 80 pF. 

In the previous example, if the load capacitance is 20 pF instead of 80 pF, a 
scale factor of 0.84 is used, yielding: 

Primary: 0.84 x 68 mA = 57.12 mA 

Expansion: 0.84 x 34 mA = 28.56 mA 

The slope of the load capacitance line in Figure D-1 is 0.26% normalized Iqd 
per pF. While this slope may be used to interpolate scale factors for loads great- 
er than 80 pF, the TMS320C30 is specified to drive output loads less than 80 
pF, and interface timings cannot be guaranteed at higher loads. With data de- 
pendency and capacitive load scale factors applied to the current values for 
primary and expansion buses, the total supply current required for the device 
for a particular application can be calculated, as described in the next section. 
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Figure D-10. Current Versus Output Load Capacitance 



D 
_Q 

TJ 

"(5 

E 



1 

0.95 

0.9 

0.85 

0.8 
75 




•dd 


Versus Output Load Capacitance 





















10 20 30 40 50 60 70 

Output Load Capacitance (pF) 



80 



D-18 



Calculation of TMS320C30 Power Dissipation 



Calculation of Total Supply Current 



D.4 Calculation of Total Supply Current 



The previous sections have discussed currents contributed by several different 
sources on the TMS320C30. Because determinations of actual current values 
are unique and independent for each source, each current source was dis- 
cussed separately. In an actual application, however, the sum of the indepen- 
dent contributions from each current determines the total current requirement 
for the device. This total current value is exhibited as the total current supplied 
to the device through all of the Vqq inputs and returned through the Vss con- 
nections. 

Note that numerous Vpp and Vss P' ns on the device are routed to a variety 
of internal connections, not all of which are common. Externally, however, all 
of these pins should be connected in parallel to 5 V and ground planes, respec- 
tively, with as low impedance as possible. 

As mentioned previously, because different program segments inherently per- 
form different operations that are often quite distinct from each other, it is typi- 
cally appropriate to consider current for each of the different segments inde- 
pendently. Once this is done, then peak current requirements are readily ob- 
tained. Further, average current calculations can also be made that can be 
used to determine heating effects of power dissipation. These effects, in turn, 
can be used to determine thermal management considerations. 



D.4.1 Combining Supply Current Due to All Components 



To determine the total supply current requirements for any given program activ- 
ity, calculate each of the appropriate components and combine them in the fol- 
lowing sequence: 

1 ) Start with 1 1 0-mA quiescent current requirement. 

2) Add 55 mA for internal operations unless the device is dormant, such as 
when executing IDLE, NOPs, or branches-to-self, or performing internal 
and/or external bus operations using an RPTS instruction (see Internal 
Operations section). Internal or external bus operations executed via 
RPTS do not contribute an internal operations power supply current com- 
ponent and hence do not add 55 mA to quiescent current. Therefore, cur- 
rent components in the next two steps may still be required even though 
the 55 mA is omitted. 

3) If significant internal bus operations are being performed (see Internal Bus 
Operations section), add the calculated current value. 

4) If external writes are being performed at high speed (see Current Require- 
ments Due to Output Driver Circuitry section), add 60 mA and then add the 
values calculated for primary and expansion bus current components. If 
only one external bus is being used, the appropriate incremental current 
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for the unused bus should still be included because the current offsets in- 
clude components required for operating both buses. Note, however, that, 
as discussed previously, the total current contribution for external buses, 
including baseline, must always be positive. 

The current value resulting from summing these components is the total device 
current requirement for a given program activity. 

D.4.2 Supply Voltage, Operating Frequency, and Temperature Dependencies 

Besides current dependencies that are specific to each of the components of 
supply current discussed earlier, supply voltage level, operating temperature 
and operating frequency also affect current requirements. However, these 
considerations affect the total supply current, not just specific components 
(that is, internal or external bus operations). 1 Note that supply voltages, operat- 
ing temperature, and operating frequency must be maintained within required 
device specifications. 

In the same manner as datadependencies discussed in other sections, consid- 
erations for these dependencies are applied as a scale factor. This factor is 
applied, once the total current for a particular program segment has been de- 
termined. Figure D— 11 shows the relative scale factors to be applied to the sup- 
ply current values as a function of both Vprj and operating frequency. 

Power supply current consumption does not vary significantly with operating 
temperature. However, if desired, a scale factor of 2% normalized lDDP er 50°C 
change in operating temperature may be used to derate current within the spe- 
cified range noted in the TMS320C30 data sheet. This temperature depen- 
dence is shown graphically in Figure D-1 2. Note that a temperature scale fac- 
tor of 1 .0 corresponds to current values at 25°C, which is the temperature at 
which all other references in the document are made. 
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Figure D-11. Current Versus Frequency and Supply Voltage 
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Figure D-12. Current Versus Operating Temperature Change 
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D.4.3 Design Equation 



The procedure for determining the power supply current requirement can be 
summarized in the following equation: 

I = Uq + hops + libus + Ixbus) x FV x T 

where 

lq =110 m A, 

'iops = ^ mA, 

kbus = Di x f-j (see Table D-1), 

'xbus = vase + 'prim + 'exp> 
with 

base =60mA, 

Iprim = D 2 x Q? x ^2 ( see Table D-1), 

l exp = D3 x C3 x f3 (see Table D-1). 

FV\s the scale factor for frequency and supply voltage, and 

Tis the scale factor for operating temperature. 

Table D-1 describes the symbols used in the power supply current equation. 
Furthermore, the table displays the figure number from which the value can be 
obtained. 
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Table D-1. Current Equation Symbols 



Symbol 


Description 


Graph/Value 


k 


Quiescent Current 


110mA 


liops 


Internal Operations Current 


55 mA 


: fousy 


Internal Bus Operations Current 




D 1 


Internal Bus Data Scale Factor 


Figure D-3 


f1 


Internal Bus Current Requirement 


Figure D-2 


Ixbus 


External Bus Operations Current 




base 


External Bus Base Current 


60 mA 


prim 


Primary Bus Operations Current 




D 2 


Primary Bus Data Scale Factor 


Figure D-8 


C 2 


Primary Bus Cap Load Scale Factor 


Figure D-1 


f2 


Primary Bus Current Requirement 


Figure D-4 or D-5 


kxp 


Expansion Bus Operations Current 




D 3 


Expansion Bus Data Scale Factor 


Figure D-9 


c 3 


Expansion Bus Cap Load Scale Factor 


Figure D-1 


f3 


Expansion Bus Current Requirement 


Figure D-6 or D-7 


FV 


Freq/Supply Voltage Sea! * "actor 


Figure D-1 1 


: -V^. ■■'■• 


Temperature Scale Factor 


Figure D-1 2 



D.4.4 Peak Versus Average Current 



If current is observed over the course of an entire program, typically some seg- 
ments will exhibit significantly different levels of current required for different 
durations of time. For example, a program may spend 80% of its time perform- 
ing internal operations, drawing a current of 250 mA, and spend the remaining 
20% of its time performing writes at full speed to the expansion bus, drawing 
300 mA. 

While knowledge of peak current levels is important in order to establish power 
supply requirements, some applications require information about average 
current. This is particularly significant if periods of high peak current are short 
in duration. Average current can be obtained by performing a weighted sum 
of the currents due to the various independent program segments over time. 
In the example above, the average current can be calculated as follows: 

/ = 0.8 x 250 mA + 0.2 x 300 mA = 260 mA 

Using this approach, average current for any number of program segments 
can be calculated. 
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D.4.5 Thermal Management Considerations 



Heating characteristics of the TMS320C30 are dependent upon power dissipa- 
tion, which is, in turn, dependent upon power supply current. When making 
thermal management calculations, several considerations must be made that 
relate to the manner in which power supply current contributes to power dissi- 
pation and to the TMS320C30 package thermal characteristics' time constant. 

Depending on sources and destinations of current on the device, some current 
contributions to Iqd do not constitute a component of power dissipation at 5 
volts. Accordingly, if the total current flowing into Vqd is used to calculate pow- 
er dissipation at 5 volts, erroneously large values for power dissipation will be 
obtained. Power dissipation is defined as: 

P = / x V 

(where P is power, I is current, and V is voltage). If device outputs are driving 
any DC load to a logic high level, only a minor contribution is made to power 
dissipation because CMOS outputs typically drive to a level within a few tenths 
of a volt of the power supply rails. If this is the case, subtract these current com- 
ponents out of the total supply current value; then calculate their contribution 
to power dissipation separately, and add it to the total power dissipation (see 
Figure D-13). If this is not done, these currents resulting from driving a logic 
high level into a DC load will cause unrealistically high power dissipation val- 
ues. The error occurs because the currents resulting from driving a logic high 
level into a DC load will appear as a portion of the current used to calculate 
power dissipation due to Vqd at 5 volts. 



Figure D-13. Load Currents 
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Furthermore, external loads draw supply only current when outputs are being 
driven high, because when outputs are in the logic zero state, the device is 
sinking current, which is supplied from an external source. Therefore, the pow- 
er dissipation due to this current component will not have a contribution 
through Iqd Du t will contribute to power dissipation with a magnitude of: 

P = Vo L x Iol 

where Vol ' s the low level output voltage and Iol " s the current being sunk by 
the output as shown in Figure D-1 3. The power dissipation component due to 
outputs being driven low should be calculated and added to the total power dis- 
sipation. 

When outputs with DC loads are being switched, the power dissipation compo- 
nents from outputs being driven high and outputs being driven low are aver- 
aged and added to the total device power dissipation. Calculation of power 
components due to DC loading of the outputs should be made separately for 
each program segment before average power is calculated. 

Note that any unused inputs that are left disconnected may float to a voltage 
level that will cause input buffer circuits to remain in the linear region and there- 
fore contribute a significant component to power supply current. Accordingly, 
any unused inputs should be made inactive by being either grounded or pulled 
high if absolute minimum power dissipation is desired. If several unused inputs 
must be pulled high, they may be pulled high together through one resistor to 
minimize component count and board space. 

When you use power dissipation values to determine thermal management 
considerations, you should use the average power unless the time duration of 
individual program segments is long. The thermal characteristics of the 
TMS320C30 in the 1 81 -pin PGA package are exponential in nature with a time 
constant t = 4.5 minutes. Therefore, when subjected to a change in power, the 
temperature of the device package will reach approximately 63% of the total 
temperature change due to this power after 4.5 minutes. Accordingly, if the 
time duration of program segments exhibiting high power dissipation values 
is short (on the order of a few seconds), average power, calculated in the same 
manner as average current as described in the previous subsection, can be 
used. 

Otherwise, maximum device temperature should be calculated on the basis of 
the actual time duration of the program segments involved. For example, if a 
particular program segment lasts for 7 minutes, then using the exponential 
function, it can be calculated that a device will reach approximately 80% of the 
temperature due to the total power dissipation during the program segment. 

Note that the average power should be determined by calculating the power 
for each program segment (including considerations described above) and 
performing a time average of these values, rather than simply multiplying the 
average current as determined in the previous subsection, by Vrjo- 
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Specific device temperature calculations are made using the TMS320C30 
thermal impedance characteristics included in the TMS320C30 data sheet in 
Chapter 1 3 of the TMS320C3x User's Guide. 
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Example Supply Current Calculations 



D.5 Example Supply Current Calculations 



An FFT represents a typical DSP algorithm. The FFT code in Section D.8 pro- 
cesses data in the RAM blocks and writes the result out to zero wait-state exter- 
nal SRAM on the primary bus. The program executes out of zero wait state ex- 
ternal SRAM on the primary bus, and the TMS320C30's cache is enabled. The 
entire algorithm consists mainly of internal bus operations and hence includes 
quiescent and, in general, internal operations as well. At the end of processing, 
the 1024 results are written out on the primary bus. Therefore, the algorithm 
exhibits a higher current requirement during the write portion, where the exter- 
nal bus is being used significantly. 



D.5.1 Processing 



The processing portion of the algorithm is 95% of the total algorithm. During 
this portion, the power supply current is required only for the internal circuitry. 
The processing of data is done in several loops that compose a majority of the 
algorithm. During these loops, two operands are transferred on every cycle. 
The current required for internal bus usage, then, is 55 mA taken from 
Figure D-2. The data is assumed to be random. A data value scale factor of 
0.8 is used from Figure D-3. This value scales 55 mA, yielding 44 mA for inter- 
nal bus operations. Adding 44 mA to the quiescent current requirement and in- 
ternal operations current requirement yields a current requirement of 209 mA 
for the major portion of the algorithm. 



/ = /„ + I, 



tops 



ibus 



I = 110 mA + 55 mA + (55 mA){0.Q) = 209 mA 



D.5.2 Data Output 



The portion of the algorithm corresponding to writing out data is approximately 
5% of the total algorithm. Again, the data that is being written is assumed to 
be random. From Figure D-3 and Figure D-8, scale factors of 0.80 and 0.85 
are used for derating due to data value dependency for internal and primary 
buses, respectively. During the data dump portion of the code, a load and store 
are performed every cycle; however, the parallel load/store instruction is in an 
RPTS loop, so there is no contribution due to internal operations, because the 
instruction is fetched only once. The only internal contributions are due to 
quiescent and internal bus operations. Figure D-4 indicates a 1 70-mA current 
contribution due to writes every available cycle, and Figure D-6 indicates a 
-80-mA contribution due to not using the expansion bus (that is, greater than 
1 8 H1 cycles between writes). Therefore, the total contribution due to this por- 
tion of the code is: 



I — In + linns + A 



'xbus 
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D.5.3 Average 



or, 

/ = 110 + (55 mA)(0.8) + 60 mA - 80 mA + (170 mA){0.85) 
= 278.5 mA 



The average current is derived from the two portions of the algorithm. The pro- 
cessing portion took 95% of the time and required about 21 mA, and the data 
dump portion took the other 5% and required about 280 mA. The average is 
calculated as: 

l avg = (0.95)(21 mA) + (0.05)(280 mA) = 213.5 mA 

From the thermal characteristics specified in the TMS320C3x User's Guide, 
it can be shown that this current level corresponds to a case temperature of 
43°C. This temperature meets the maximum device specification of 85°C and 
hence requires no forced air cooling. 



D.5.4 Experimental Results 



A photograph of the power supply current for the FFT is shown in the photo- 
graph in Appendix A. During the FFT processing, the current measured varied 
between 1 80 and 220 mA. The peak of the current during external writes was 
270 mA, and the average current requirement, as measured on a digital multi- 
meter was 200 mA. The calculations yielded results that were extremely close 
to the actual measured power supply current. 
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D.6 Summary 



Summary 



The power supply current requirement for the TMS320C30 cannot be ex- 
pressed simply in terms of operating frequency, supply voltage, and output 
load capacitance. A more complete specification, one based on device func- 
tionality, must be used to determine a more accurate power supply current re- 
quirement. This application note presents the necessary information you will 
require to determine power supply specifications. The specification is based 
on a knowledge of the algorithm and its operation on the TMS320C30 in terms 
of internal and external bus usage. As devices become more complex, the 
application of the approach presented in this document becomes vital. 

The power supply current requirement for the TMS320C30 depends on device 
functionality and system parameters. The components of current related to de- 
vice functionality are those due to quiescent current, internal operations, inter- 
nal bus operations, and external bus operations. The dependencies related to 
system parameters are those due to operating frequency, supply voltage, out- 
put load capacitance, and operating temperature. The typical power supply 
current requirement is 250 mA, and the minimum, or quiescent, is 110 mA. 



The maximum current requirement is 600 m A and occurs only un- 
der WORST CASE conditions: writing alternating data 
(AAAAAAAAh to 55555555h) out of both external buses simulta- 
neously, every cycle, with 80 pF loads and running at 33 MHz. 
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D.7 Photo of l DD for FFT 
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D.8 FFT Assembly Code 



FFT Assembly Code 



. GLOBL 


FFT 




. GLOBL 


N 




. GLOBL 


M 




. GLOBL 


SINE 




SINTAB: 




; setup 


.WORD 


SINE 




RAMO: 






. WORD 


809800h 




OUTBUF : 






. WORD 


800h 




.TEXT 






FFT : LDP 


SINTAB 


; proce, 



processing portion: 
; quiescent, internal and 
; bus operations 



LDI 
LSH 



N, IRO 
-1,IR0 



; LENGTH-TWO BUTTERFLIES 



LDI 


@RAM0,AR0 


LDI 


IR0,RC 


SUBI 


1,RC 


RPTB 


BLK1 


ADDF 


*+AR0,*AR0++,R0 


SUBF 


*AR0,*-AR0,R1 


BLK1 STF 


R0,*-AR0 


I | STF 


R1,*AR0++ 



; FIRST PASS OF THE DO-20 LOOP (STAGE K=2 IN DO-10 LOOP) 





LDI 


@RAM0,AR0 




LDI 


2, IRO 




LDI 


N,RC 




LSH 


-2,RC 




SUBI 


1,RC 




RPTB 


BLK2 




ADDF 


*+AR0 ( IRO ) , *AR0++ ( IRO ) , RO 




SUBF 


*AR0,*-AR0(IR0) ,R1 




NEGF 


*+AR0,R0 


1 1 


STF 


R0,*-AR0(IR0) 


BLK2 


STF 


R1,*AR0++(IR0) 


1 1 


STF 


RO, *+AR0 



; MAIN LOOP (FFT STAGES) 



LDI 



N,IR0 
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LSH 


-2, IRO 




LDI 


3,R5 




LDI 


1,R4 




LDI 


2,R3 


LOOP 


LSH 


-1, IRO 




LSH 


1,R4 




LSH 


1,R3 



INNER LOOP (DO-20 LOOP IN THE PROGRAM) 



INLOP: 



LDI 


@RAM0,AR5 


LDI 


IR0,AR0 


ADD I 


@SINTAB,ARO 


LDI 


R4,IR1 


LDI 


AR5 , AR1 


ADD I 


1,AR1 


LDI 


AR1 , AR3 


ADD I 


R3,AR3 


LDI 


AR3 , AR2 


SUBI 


2,AR2 


ADD I 


R3 , AR2 , AR4 


LDF 


*AR5++(IR1) , RO 


ADDF 


*+AR5 (IR1) ,R0,R1 


SUBF 


R0,*++AR5 (IR1) ,R0 


STF 


Rl, *-AR5 (IR1) 


NEGF 


RO 


NEGF 


*++AR5 (IR1) ,R1 


STF 


R0,*AR5 


STF 


R1,*AR5 



INNERMOST LOOP 



BLK3 



LDI 


N,IR1 


LSH 


-2,IR1 


LDI 


R4,RC 


SUBI 


2,RC 


RPTB 


BLK3 


MPYF 


*AR3,*+AR0 (IR1) ,R0 


MPYF 


*AR4,*AR0,R1 


MPYF 


*AR4,*+AR0 (IR1) ,R1 


ADDF 


R0,R1,R2 


MPYF 


*AR3 , *AR0 + + ( IRO ) , RO 


SUBF 


R0,R1,R0 


SUBF 


*AR2,R0,R1 


ADDF 


*AR2,R0,R1 


STF 


R1,*AR3++ 


ADDF 


*AR1,R2,R1 


STF 


R1,*AR4 


SUBF 


R2,*AR1,R1 


STF 


Rl, *AR1++ 


STF 


R1,*AR2 


SUBI 


@RAM0,AR5 
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FFT Assembly Code 



DUMP 



ADD I 


R4,AR5 


CMP I 


N,AR5 


BLTD 


INLOP 


ADD I 


@RAM0,AR5 


NOP 




NOP 




ADD I 


1,R5 


CMP I 


M,R5 


BLE 


LOOP 


LDI 


@RAM0,AR0 


LDI 


@OUTBUF,ARl 


LDF 


*AR0++,R0 


RPTS 


N-2 


LDF 


*AR0++,R0 


STF 


RO, *AR1++ 


STF 


RO, *AR1++ 


LDI 


RAMO , AR1 


LDI 


@RAM0,AR0 


XOR 


400h,AR0 


STI 


AR0,*AR1 



data dump portion 
quiescent, internal bus 



ops and primary bus ops 



swap RAM banks 



,END 



FFT 
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Architectural Overview 



CPU Registers, Memory, and Cache 



Data Formats and Floating-Point Operation 



Addressing 



Program Flow Control 



External Bus Operation 



Peripherals 



Pipeline Operation 



Assembly Language Instructions 



Software Applications 



Hardware Applications 
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SMJ320C30 Digital Signal Processor Data Sheet 



Chapter E 

SMJ320C30 Digital Signal Processor 

Data Sheet 



This appendix contains the standalone data sheet for the SMJ320C30 Digital 
Signal Processor. 
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SMJ320C30 
DIGITAL SIGNAL PROCESSOR 
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Performance 

— SMJ320C30-28 (70-ns Single Cycle 
Instruction Execution Time) 

- 28.6 MFLOPS (Million 
Floating-Point Operations per 
Second) 

- 14.3 MIPS (Million Instructions per 
Second) 

— SMJ320C30-25 (80-ns Single Cycle 
Instruction Execution Time) 

- 25 MFLOPS 

- 12.5 MIPS 

• Class B High-Reliability Processing 

• One 4K x 32-Bit Single-Cycle Dual-Access 
On-Chip ROM Block 

• Two 1K x 32-Bit Single-Cycle Dual-Access 
On-Chip RAM Blocks 

• 64-Word x 32-Bit Instruction Cache 

• 32-Bit Instruction and Data Words, 24-Bit 
Addresses 

• 40/32-Bit Floating Point/Integer Multiplier 
and ALU 

• 32-Bit Barrel Shifter 

• Eight Extended-Precision Registers 
(Accumulators) 

• Two Address Generators With Eight 
Auxiliary Registers and Two Auxiliary 
Register Arithmetic Units 

• On-Chip Direct Memory Access (DMA) 
Controller for Concurrent I/O and CPU 
Operation 

• Integer, Floating Point, and Logical 
Operations 

• Two- and Three-Operand Instructions 

• Parallel ALU and Multiplier Executions in a 
Single Cycle 

• Block Repeat Capability 

• Zero-Overhead Loops with Single-Cycle 
Branches 

• Conditional Calls and Returns 

• Interlocked Instructions for 
Multiprocessing Support 

• Two External Interface Ports 

EPIC is a trademark of Texas Instruments Incorporated. 



GB Package* 
(Top View) 



123456789 1011 12131415 





HU/HTPackaget 
(Top View) 




t Pin assignments/function information are provided 
by the page 2 and 3 table, pin assignments. 

Two Serial Ports with Support for 
8/16/32-Bit Transfers 

Two 32-Bit Timers 

1.0 Micron EPIC™ CMOS Technology 

181-Pin Grid Array Ceramic Package 
(GB Suffix) 

Two 196-Pin Leaded Ceramic Chip Carriers 

— Quad flat pack (HT Suffix) 

— Gull wing carrier (HU Suffix) 

-55°C to 125°C Operating Temperature 
Range 



PRODUCTION DATA document* contain Information 
current n ol publlcition dite. Product! conform to 
specifications per the terms of Texts Instruments 
stindird warranty. Production processing does not 
necessarily Include testing of all parameters. 
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description 



The SMJ320C30's internal bus and special digital processing (DSP) instruction set have the speed and flexibility 
to execute up to 28.6 MFLOPS (million floating-point operations per second). The SMJ320C30 optimizes speed 
by implementing functions in hardware that other processors implement through software or microcode. This 
hardware-intensive approach provides the design engineer with power previously unavailable on a single chip. 

The SMJ320C30 can perform parallel multiply and ALU operations on integer or floating-point data in a single 
cycle. The processor also possesses ageneral-purpose registerfile, program cache, dedicated auxiliary register 
arithmetic units (ARAU), internal dual-access memories, one DMA channel supporting concurrent I/O, and a 
short machine-cycle time. High performance and ease of use are achieved through greater parallelism, greater 
accuracy, and general-purpose features. 

General-purpose applications are greatly enhanced by the large address space, multiprocessor interface, 
internally and externally generated wait states, two external interface ports, two timers, two serial ports, and 
multiple interrupt structure. The SMJ320C30 supports a wide variety of system applications from host processor 
to dedicated coprocessor. 

High-level language is more easily implemented through a register-based architecture, large address space, 
powerful addressing modes, flexible instruction set, and well-supported floating-point arithmetic. 
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pin functional description 

GB package pin function assignments 



PIN 


FUNCTION 


PIN 


FUNCTION 


PIN 


FUNCTION 


PIN 


FUNCTION 


PIN 


FUNCTION 


F15 


AO 


C4 


DO 


P3 


FSRO 


A13 


XAO 


R4 


XDO 


G12 


A1 


D5 


D1 


R2 


FSXO 


A14 


XA1 


P5 


XD1 


G13 


A2 


A2 


D2 


N4 


CLKRO 


D11 


XA2 


N6 


XD2 


G14 


A3 


A3 


D3 


M5 


CLKXO 


C12 


XA3 


R5 


XD3 


G15 


A4 


B4 


D4 


R1 


DRO 


B13 


XA4 


P6 


XD4 


H15 


A5 


C5 


D5 


R3 


DXO 


A15 


XA5 


M7 


XD5 • 


H14 


A6 


D6 


D6 


M3 


FSR1 


B15 


XA6 


R6 


XD6 


J15 


A7 


A4 


Df 


P1 


FSX1 


C14 


XA7 


N7 


XD7 


J14 


A8 


B5 


D8 


L4 


CLKR1 


E12 


XA8 


P7 


XD8 


J13 


A9 


C6 


D9 


N2 


CLKX1 


D13 


XA9 


R7 


XD9 


K15 


A10 


A5 


D10 


N1 


DR1 


C15 


XA10 


P8 


XD10 


J12 


A11 


B6 


D11 


P2 


DX1 


D14 


XA11 


R8 


XD11 


K14 


A12 


D7 


D12 






E13 


XA12 


R9 


XD12 


L15 


A13 


A6 


D13 


F14 


EMUO 


J3 


RSVO 


P9 


XD13 


K13 


A14 


C7 


D14 


E15 


EMU1 


J4 


RSV1 


N9 


XD14 


L14 


A15 


B7 


D15 


F13 


EMU2 


K1 


RSV2 


R10 


XD15 


M15 


A16 


A7 


D16 


E14 


EMU3 


K2 


RSV3 


M9 


XD16 


K12 


A17 


A8 


D17 


F12 


EMU4 


L1 


RSV4 


P10 


XD17 


L13 


A18 


B8 


D18 


C1 


EMU5 


K3 


RSV5 


R11 


XD18 


M14 


A19 


A9 


D19 


M6 


EMU6 


L2 


RSV6 


N10 


XD19 


N15 


A20 


B9 


D20 


B3 


H1 


K4 


RSV7 


P,11 


XD20 


M13 


A21 


C9 


D21 


A1 


H3 


M1 


RSV8 


R12 


XD21 


L12 


A22 


A10 


D22 






L3 


RSV9 


M10 


XD22 


N14 


A23 


D9 


D23 


C2 


X1 


M2 


RSV10 ■ 


N11 


XD23 


E5 


LOCATOR 


B10 


D24 


B1 


X2/CLKIN 


D12 


ADV DD 


P12 


XD24 


G1 


IACK 


A11 


D25 


P4 


TCLKO 


H11 


ADV DD 


R13 


XD25 


H2 


INTO 


C10 


D26 


N5 


TCLK1 


D4 


DDV DD 


R14 


XD26 


H1 


INT1 


B11 


D27 






E8 


DDV DD 


M11 


XD27 


J1 


INT2 


A12 


D28 


G2 


XFO 


L8 


IODV DD 


N12 


XD28 


J2 


INT3 


D10 


D29 


G3 


XF1 


M12 


IODV DD 


P13 


XD29 


D15 


MC/MP 


C11 


D30 


D3 


Vbbp 


H5 


MDV DD 


R15 


XD30 


E3 


MSTRB 


B12 


D31 


E4 


VSUBS 


M4 


PDV DD 


P15 


XD31 


E1 
F1 


RDY 
RESET 


F3 




H4 
D8 


V D D 

v D d 


B2 
P14 


cv ss 
cv ss 


C3 
C13 


DV SS 
DV SS 


HOLD 


G4 


R/W 


E2 


HOLDA 


M8 


v D d 


C8 


vss 


N3 


DV SS 


F2 


STRB 


D2 


XRDY 


H12 


Vdd 


H3 


vss 


N13 


DV S S 


F4 


IOSTRB 


D1 


XR1W 


N8 


vss 


H13 


v S s 


B14 


iv S s 



NOTES: 1. ADVdd. DDVdq, IODVqd. MDVqd. and PDVqq pins are on a common plane internal to the device. 

2 - Vqd pins are on a common plane internal to the device. 

3. Vss. CVss. anc ^ 'VSS P' ns are on a common plane internal to the device. 

4. DVss pins are on a common plane internal to the device. 
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HU/HT package pin function assignments 



NOTES: 1. ADVqd. DDVqd. IODN/qd. MDVqd. and PDVqd P in s are on a common piano internal to the device. 

2. Vqo P ,ns are on a common plane internal to the device. 

3. Vss. CVss. and IVss P^s are on a common plane internal to the device. 

4. DVgg pins are on a common plane internal to the device. 



PIN 


FUNCTION 


PIN 


FUNCTION 


PIN 


FUNCTION 


PIN 


FUNCTION 


PIN 


FUNCTION 


1 


PDV DD 


41 


XD23 


81 


A1 


121 


D19 


161 


HOLD 


2 


PDV DD 


42 


XD24 


82 


AO 


122 


D18 


162 


MDV DD 


3 


DRO 


43 


XD25 


83 


EMUO 


123 


VDD 


163 


MDV DD 


4 


FSRO 


44 


XD26 


84 


EMU1 


124 


V D D 


164 


RDY 


5 


CLKRO 


45 


XD27 


85 


EMU2 


125 


vss 


165 


STRB 


6 


CLKXO 


46 


XD28 


86 


EMU3 


126 


vss 


166 


RAV 


7 


FSXO 


47 


XD29 


87 


EMU4 


127 


D17 


167 


RESET 


8 


DXO 


48 


XD30 


88 


MC/MP 


128 


D16 


168 


XF1 


9 


TCLKO 


49 


IODV D D 


89 


XA12 


129 


D15 


169 


XFO 


10 


TCLK1 


50 


DV SS 


90 


XA11 


130 


D14 


170 


IACK 


11 


EMU6 


51 


cv ss 


91 


XA10 


131 


D13 


171 


INTO 


12 


XDO 


52 


cv ss 


92 


XA9 


132 


D12 


172 


vdd • 


13 


XD1 


53 


XD31 


93 


XA8 


133 


D11 


173 


vdd 


14 


XD2 


54 


A23 


94 


XA7 


134 


D10 


174 


vss 


15 


IODV DD 


55 


A22 


95 


XA6 


135 


D9 


175 


v S s 


16 


IODV DD 


56 


A21 


96 


"Vss 


136 


D8 


176 


INT1 


17 


XD3 


57 


A20 


97 


•vss 


137 


D7 


177 


INT2 


18 


XD4 


58 


A19 


98 


DV SS 


138 


D6 


178 


INT3 


19 


XD5 


59 


A18 


99 


VSUBS 


139 


D5 


179 


RSVO 


20 


XD6 


60 


A17 


100 


ADV DD 


140 


D4 


180 


RSV1 


21 


XD7 


61 


A16 


101 


adv dd 


• 141 


D3 


181 


RSV2 


22 


XD8 


62 


A15 


102 


XA5 


142 


02 


182 


RSV3 


23 


XD9 


63 


A14 


103 


XA4 


143 


D1 


183 


RSV4 


24 


XD10 


64 


ADV DD 


104 


XA3 


144 


DO 


184 


RSV5 


25 


V D D 


65 


A13 


105 


XA2 


145 


H1 


185 


RSV6 


26 


Vdd 


66 


A12 


106 


XA1 


146 


H3 


186 


RSV7 


27 


vss 


67 


A11 


107 


XAO 


147 


DDV DD 


187 


RSV8 


28 


v S s 


68 


A10 


108 


D31 


148 


DV S S 


188 


RSV9 


29 


XD11 


69 


A9 


109 


D30 


149 


cv ss 


189 


RSV10 


30 


XD12 


70 


A8 


110 


D29 


150 


cv ss 


190 


DR1 


31 


XD13 


71 


A7 


111 


D28 


151 


X2/CLKIN 


191 


FSR1 


32 


XD14 


72 


A6 


112 


D27 


152 


X1 


192 


CLKR1 


33 


XD15 


73 


vdd 


113 


D26 


153 


VSUBS 


193 


CLKX1 


34 


XD16 


74 


vdd 


114 


DDV DD 


154 


VBBP 


194 


FSX1 


35 


XD17 


75 


vss 


115 


D25 


155 


EMU5 


195 


DX1 


36 


XD18 


76 


vss 


116 


D24 


156 


XRDY, 


196 


DV S S 


37 


XD19 


77 


A5 


117 


D23 


157 


MSTRB 






38 


XD20 


78 


A4 


118 


D22 


158 


IOSTRB 






39 


XD21 


79 


A3 


119 


D21 


159 


XR/W 






40 


XD22 


80 


A2 


120 


D20 


160 


HOLDA 
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signal descriptions 

This section gives signal descriptions for the SMJ320C30 device in the microprocessor mode. The following 
tables list each signal, the number of pins, function, and operating mode(s), i.e., input, output, or high-impedance 
state as indica ted by I, O, or Z. All pins labelled NC are not to be connected by the user. A line over a signal name 
(e.g., RESET) indicates that the signal is active low (true at a logic level). The signals are grouped according 
to function. 

signal descriptions 



PIN 


i/o/zt 


DESCRIPTION 


NAME 


NOs. 


PRIMARY BUS INTERFACE 


D31-D0 


32 


i/o/z 


32-bit data port of the primary bus interface. 


A23-A0 


24 


o/z 


24-bit address port of the primary bus interface. 


R/W 


1 


o/z 


Read/write signal for primary bus interface. This pin is high when a read is performed and low 
when a write is performed over the parallel interface. 


STRB 


1 


o/z 


External access strobe for the primary bus interface. 


RDY 


1 


I 


Ready signal. This pin indicates that the external device is prepared for a primary bus inter- 
face transaction to complete. As long as RDY is a logic high, the data and address buses of 
the primary bus interface remain valid. 




1 


I 


Hold signal for primary bus interface. When HOLD is a logic low, any ongoing transaction is 


completed. The A23-A0, D31-D0, STRB, and R/W signals are placed in a high-impedance 
state, and all transactions over the primary bus interface are held until HOLD becomes a logic 
high. 


HOLD 




1 





Hold acknowledge signal for primary bus interface. This signal is generated in response to a 
logic low on HOLD. It signals that A23-A0, D31-D0, STRB, and R/W are placed in a high-im- 
pedance state and that all transactions over the bus will be held. HOLDA will be high in re- 


HOLDA 


sponse to a logic high of HOLD. 


EXPANSION BUS INTERFACE 


XD31-XD0 


32 


l/O/Z 


32-bit data port of the expansion bus interface. 


XA12-XA0 


13 


O/Z 


1 3-bit address port of the expansion bus interface. 


XR/W 


1 


O/Z 


Read/write signal for expansion bus interface. When a read is performed, this pin is held high; 
when a write is performed, this pin is low. 


MSTRB 


1 





External memory access strobe for the expansion bus interface. 




1 





External I/O access strobe for the expansion bus interface. 


IOSTRB 




1 


I 


Ready signal. This pin indicates that the external device is prepared for an expansion bus 


XRDY 


interface transaction to complete. As long as XRDY is high, the data and address buses of 
the expansion bus interface remain valid. 


CONTROL SIGNALS 




1 


I 


Reset. When this pin is a logic low, the device is placed in the reset condition. When RESET 
becomes a logic high, execution begins from the location specified by the reset vector. 


RESET 


TNT3-TNT6 


4 


I 


External interrupts. 


IACK 


1 





Interrupt acknowledge signal. IACK is set to 1 by the IACK instruction. This can be used to indicate 
the beginning or end of an interrupt service routine. 


MC/MP 


1 


I 


Microcomputer/microprocessor mode pin. 


XF1.XF0 


2 


I/O 


External flag pins. They are used as general-purpose I/O pins or to support interlocked processor 
instructions. 



t Input, output, high-impedance state. 
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signal descriptions (continued) 


PIN 


i/o/zt 


DESCRIPTION 


NAME 


NOs. 


SERIAL PORTO SIGNALS 


CLKXO 


1 


I/O 


Serial port transmit clock. This pin serves as the serial shift clock for the serial port transmitter. 


DXO 


1 


O/Z 


Data transmit output. Serial port transmits serial data on this pin. 


FSXO 


1 


I/O 


Frame synchronization pulse for transmit. The FSXO pulse initiates the transmit data process 
over pin DXO. 


CLKRO 


1 


I/O 


Serial port receive clock. This pin serves as the serial shift clock for the serial port receiver. 


DRO 


1 


I 


Data receive. Serial port receives serial data via the DRO pin. 


FSRO 


1 


I 


Frame synchronization pulse for receive. The FSRO pulse initiates the receive data process over 
DRO. 


SERIAL PORT 1 SIGNALS 


CLKX1 


1 


I/O 


Serial port 1 transmit clock. This pin serves as the serial shift clock for the serial port 1 transmitter. 


DX1 


1 


o/z 


Data transmit output. Serial port 1 transmits serial data on this pin. 


FSX1 . 


1 


I/O 


Frame synchronization pulse for transmit. The FSX1 pulse initiates the transmit data process 
over pin DX1. 


CLKR1 


1 


I/O 


Serial port 1 receive clock. This pin serves as the serial shift clock for the serial port 1 receiver. 


DR1 


1 


I 


Data receive. Serial port 1 receives serial data via the DR1 pin. 


FSR1 


1 


I 


Frame synchronization pulse for receive. The FSR1 pulse initiates the receive data process 
over DR1. 


TIMER SIGNALS 


TCLKO 


1 


I/O 


Timer clock. As an input, TCLKO is used by timer to count external pulses. As an output pin, 
TCLKO outputs pulses generated by timer 0. 


TIMER 1 SIGNALS 


TCLK1 


1 


I/O 


Timer clock. As an input, TCLK1 is used by timer 1 to count external pulses. As an output pin, 
TCLK1 outputs pulses generated by timer 1 . 


SUPPLY AND OSCILLATOR SIGNALS (see Note 5) 


V D D 


4/8 


I 


+5 V supply pin. 


IODV DD 


2/3 


I 


+5 V supply pin. 


ADV DD 


2/3 


I 


+5 V supply pin. 


PDV DD 


1/2 


I 


+5 V supply pin. 


DDV DD 


2/2 


I 


+5 V supply pin. 


MDV DD 


1/2 


I 


+5 V supply pin. 


v S s 


4/8 


I 


Ground pin. 


DV SS 


4/4 


I 


Ground pin. 


cv S s 


2/4 


I 


Ground pin. 


ivss 


1/2 


I 


Ground pin. 


V BBP 


1/1 


NO 


Vrb pump oscillator output. 


VSUBS 


1/2 


I 


Substrate pin. Tie to ground. 


X1 


1 





Output pin from the internal oscillator for the crystal. If a crystal is not used, this pin should be 
left unconnected. 


X2/CLKIN 


1 


I 


Input pin to the internal oscillator from the crystal or a clock. 


H1 


1 





External H1 clock. This clock has a period equal to twice CLKIN. 


H3 


1 





External' H3 clock. This clock has a period equal to twice CLKIN. 



t Input, output, high-impedance state. 
NOTE 5: GB/HU power pins. 
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signal descriptions (concluded) 



PIN 


i/o/zt 


DESCRIPTION 


NAME 


NOs. 


RESERVED 


EMU0-EMU2 


3 


I 


Reserved. Use pullups to +5 volts. 


EMU3 


1 





Reserved. 


EMU4 


1 


I 


Reserved. Use pullups to +5 volts. 


EMU5, EMU6 


2 


NC 


Reserved. 


RSVO-RSV10 


11 


I 


Reserved. Use pullups to +5 volts. 



t Input, output, high-impedance state. 



CAUTION 

Follow the connections specified for the reserved pins. All pullup resistors must be 20 kQ. All +5 volt supply pins 
must be connected to a common supply plane, and all ground pins must be connected to a common ground plane. 



ELECTRICAL SPECIFICATIONS 

absolute maximum ratings over specified temperature range (unless otherwise noted) * 

Supply voltage range, Vcc - 0.3 V to 7 V 

Input voltage range - 0.3 V to 7 V 

Output voltage range - 0.3 V to 7 V 

Continuous power dissipation (see Note 11) 3.15 W 

Minimum free air operating temperature - 55°C 

Maximum operating case temperature 1 25°C 

Storage temperature range - 65°C to 1 50°C 

* Stresses beyond those listed under "Absolute Maximum Ratings" may cause permanent damage to the device. This is a stress rating only; 
functional operation of the device at these or any other conditions beyond those indicated in the "Recommended Operating Conditions" section 
of this specification is not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect device reliability. 
NOTES: 6. All voltage values are with respect to Vgs- 

7. Actual operating power will be less. This value was obtained under specially produced worst-caso tost conditions, which nro not 
sustained during normal device operation. These conditions consist of continuous parallel writes of a checkerboard pattern to both 
primary and extension buses at the maximum rate possible. See normal (lrjc) current specification in the "electrical characteristics" 
table and also read Calculation of TMS320C30 Power Dissipation Application Report. 
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recommended operating conditions (see Note 8) 




MIN 


NOM 


MAX 


UNIT 


Vqd Supply voltages 


SMJ320C30-28 


4.75 


5 , 


5.25 


V 


SMJ320C30-25 


4.5 


5 


5.5 


Vss Supply voltages (CVSS, etc.) 





V 


V|(-| High-level input voltage 


2.1 




V D D + 0.3§ 


V 


Vjh CLKIN high-level input voltage for CLKIN 


3 




V DD +0.3§ 


V 


V||_ Low-level input voltage 


-0.3§ 




0.8 


V 


'OH High-level output current 


-300 


HA 


Iql Low-level output current 


2 


mA 


Jq Operating case temperature 


125 


•c 


T/\ Operating free-air temperature 


-55 


•c 



§ These values derived from characterization but not tested. 
NOTE 8: All input and output voltages are TTL compatible. 

electrical characteristics over specified temperature range 



PARAMETER 


MIN NOM MAX 


UNIT 


Voh High-level output voltage ( Vqq = Min, IrjH = Max) 


2.4 3 


V 


Vql Low-level output voltage ( Vqd = Min, Iol = Max) 


0.3 0.6 


V 


Vqlx Low-level output voltage ( Vqq = Min, Iql = Max), (XA12-XA0) 


0.6t 


V 


|^ Three-state current ( Vqq = Max) 


±20 


HA 


|| Input current ( V| = Vss t0 V DD) 


± 10 


t'A 


||p Input current (Inputs with internal pullups) (see Note 12) 


- 400 20 


HA 


|| C Input current (X2/CLKIN),(V| = Vss to Vcc) 


±50 


HA 


Iqq Supply current (Vqd = Max, f x = Max) (see Note 13) 


200 ,600 


mA 


C| Input capacitance 


15* 


pF 


Cq Output capacitance 


20* 


pF 


Cx X2/CLKIN capacitance 


25* 


PF 



t Derived from characterization and not tested. 

* Derived by design but not tested. 

NOTES: 9. All nominal values are at Vdq = 5 V, T^ = 25°C. 

1 0. f x is the input clock frequency. 

1 1 . All input and output voltage levels are TTL compatible. 

12. Pins with internal pullup devices: INT0-INT3, MC/MP, RSV0-RSV10. 

13. Actual operating current will be less than this maximum value. This value was obtained under specially produced 
worst-case test conditions, which are not sustained during normal device operation. These conditions consist of 
continuous parallel writes of a checkerboard pattern to both primary and expansion buses at the maximum rate 
possible. See Calculation of TMS320C30 Power Dissipation Application Report. 
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PARAMETER MEASUREMENT INFORMATION 



Tester Pin v Lo ad 
Electronics 




Output 
O Under 
Test 



Where: IrjL = 2 m A (all outputs) 
Iqh = 300 nA (all outputs) 
V L oad = 2.15 V 
Cj = 80 pF typical load circuit capacitance. 



Figure 1. Test Load Circuit 



signal transition levels 

TTL-level outputs are driven to a minimum logic-high level of 2.4 volts and to a maximum logic-low level of 0.6 
volts. Output transition times are specified as follows. 

For a high-to-low transition on a TTL-compatible output signal, the level at which the output is said to be no longer 
high is 2 volts, and the level at which the output is said to be low is 1 volt. For a low-to-high transition, the level 
at which the output is said to be no longer low is 1 volt, and the level at which the output is said to be high is 2 
volts. 





Figure 2. TTL-Level Outputs 

Transition times for TTL-compatible inputs are specified as follows. For a high-to-low transition on an input 
signal, the level at which the input is said to be no longer high is 2.1 volts, and the level at which the input is said 
to be low is 0.8 volt. For a low-to-high transition on an input signal, the level at which the input is said to be no 
longer low is 0.8 volt, and the level at which the input is said to be high is 2.1 volts. 




Figure 3. TTL-Level Inputs 
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timing parameters for CLKIN, H1, and H3 (see Note 11) 


NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


1 


tf(Ci) CLKIN fall time 


5t 


5t 


ns 


2 


'w(CIL) CLKIN low pulse duration t c (ci) = min (see Note 15) 


12.5 


14 


ns 


3 


f w(CIH) CLKIN high pulse duration t c (ci) = min (see Note 15) 


12.5 


14 


ns 


4 


t r (ci) CLKIN rise time 


5t 


5t 


ns 


5 


f c(CI) CLKIN cycle time 


35 


40 


ns 


6 


tf( H ) H1/H3 fall time 


3 


4 


ns 


7 


'w(HL) H1/H3 low pulse duration (see Note 14) 


P-6 


P-6 


ns 


8 


'w(HH) H1/H3 high pulse duration (see Note 14) 


P-7 


P-7 


ns 


9 


t r (H) H1/H3 rise time 


4 


4 


ns 


9.1 


l d(HL-HH) Delay from H1(H3) low to H3(H1) high 


rjt 5 


0t 5 


ns 


10 


t c (H) H1/H3 cycle time 


70 606 


80 606 


ns 



t Derived by design but not tested. 

NOTES-: 11. All input and output voltages are TTL compatible. 

14. P-tc(CI) 

15. Rise and fall times, assuming a 35-65% duty cycle, are incorporated within this specification. See X2/CLKIN timing below. 



X2/CLKIN timing 



X2/CLKIN 



\4 5 ►! 



-(4-3-X | 



5V) 



H1/H3 timing 



H— 2— H 



H1 



y> 



L 8 ►) 



9.1 



H3 



M 



10- 



/ 



i 

I 9.1 

/r< 8 >J\ 

9 J 
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memc 










>ry read/write cycle timing (MSTRB = 0) 




NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


11 


td(H1 L-(M)SL) H1 !° w t° (M)STRB low 


0* 10 


0* 10 


ns 


12 


«d(H1L-(M)SH) H1 low to (M)STRB high 


0* 10 


0* 10 


ns 


13.1 


'd(HIH-RWL) H1 hightoR/Wlow 


0* 10 


0* 10 


ns 


13.2 


td(HIH-XRWL) H1 high to (X)R/W low 


0* 17 


0* 18 


ns 


14.1 


'd(HIL-A) H1 lowto Avalid 


0* 16 


0* 18 


ns 


14.2 


l d(H1L-XA) H1 lowto (X) A valid 


0* 12 


0* 14 


ns 


15.1 


>su(D)R D valid before H1 low (read) 


19 


19 


ns 


15.2 


>su(XD)R ( X ) D setu P before H1 low (read) 


20 


20 


ns 


16 


'h((X)D)R ( X ) D hold time after H1 low (read) 


ot 


ot 


ns 


17.1 


tsu(RDY) RDY setup before H1 high 


10 


10 


ns 


17.2 


•su(XRDY) XRDY setup before H1 high 


10 


12 


ns 


18 


•h((X)RDY) XRDY hold time after H1 high 








ns 


19 


td(H1H-(X)RWH) H1 high to (X)RAV high (write) 


12 


12 


ns 


20 


'v((X)D)W (X)D valid after H1 low (write) 


20 


20 


ns 


21 


l h((X)D)W ( X ) D nold time after H1 high (write) 








ns 


22.1 


'd(H1 H-A) H1 high to A valid on back-to-back write cycles (write) 


22 


22 


ns 


22.2 


l d(H1 H-XA) H1 high to XA valid on back-to-back write cycles (write) 


32 


32 


ns 


26 


l d(A-XRDY) (X)RDY delay from A valid 


8t 


8t 


ns 



t These 
* These 



values derived from characterization but not tested, 
values derived by design but not tested. 
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memory read cycle timing (MSTRB = 0) 



H3 



MSTRB 






(X)R/W 






~>i W— 14.1/14.2 



(X)A 



X 



X 



"X 1^-13 



15.1/15.2 ~X 15— 

26 _» y_ l-H K- 16 



(X)D 



17.1/17.2 ->| K— 



X 

o- 



(X)RDY 



^£ 



memory write cycle timing (MSTRB = 0) 

H3 
H1 



MSTRB 



(X)R/W 




I \-4 



-X k-| 13 



19 



f 



— X |4— 14.1/14.2 



(X)A X" 



* 



22.1/22.2 



(X)D 






X 



(X)RDY 



-V.k-18 i 

17.1/17.2— » |4- *~ 26 

si '/ 



^_z: 
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memc 












>ry read cycle timing (IOSTRB = 0) 




NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN 


MAX 


MIN MAX 


27 


td(H1 H-IOSL) H1 high to IOSTRB low 


0* 


11 


0* 11 


ns 


28 


<d(H1 H-IOSH) H1 high to IOSTRB high 


0* 


10 


0* 10 


ns 


29 


'd(HIL-XRWH) H1 lowtoXR/Whigh 


0* 


10 


0* 12 


ns 


30 


td(H1L-XA) H1 low to XA valid 


0* 


11 


0* 13 


ns 


31 


'su(XD)R XD setup before H1 high 


15 


15 


ns 


32 


'h(XD)R XD nold ,ime after H1 ni 9 n 


ot 


ot 


ns 


33 


'su(XRDY) XRDY setup before H1 high 


10 


12 


ns 


34 


'h(XRDY) XRDY hold time after H1 high 








ns 



t These values derived from characterization but not tested. 
t These values derived by design but not tested. 



27 -n k 

I I 


/ 
\ 


"A ^~Y 

28 -M |4— | 


IOSTRB [ V 


— V i 


I 
-*l *- 29 




| -*| k- 35 


XR/W / 

— r 

-H |4— 30 




i \i 


Xi _J^_ 




x 


XD 


31 
33 


i-* k- 32 

-Q 

I -H K— 34 


XRDY 




\ / 
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memory write cycle timing (iOSTRB = 0) 



NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


35 


'd(HH-XRWL) 


H1 low to (X)R/Wlow 


Ot 15 


Ot 15 


ns 


36 


<v(XD)W 


XD valid after H1 high 


30 


30 


ns 


37 


'h(XDW 


XD hold time after H1 low 








ns 



t These values derived by design but not tested. 



H3 



H1 



IOSTRB 



XR/W 



XA 



XD- 



XRDY 



27-H k- 28 



35 



I r 



30 



X 



/ 



v^ 



z 



X 



29 



36 



37 



< 



> 



33" 



l-H W- 34 



Z 



E-16 



TEXAS t* 

Instruments 

POST OFFICE BOX 1443 • HOUSTON, TEXAS 77001 



SMJ320C30 
DIGITAL SIGNAL PROCESSOR 











SGUS014 — FEBRUARY 1991 


timing for XFO and XF1 when executing LDFI 


orLDII 






NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


38 


l d(H3H-XF0L) H 3 high to XFO low 


15 


15 


ns 


39 


( su(XF1) XH valid before H1 low 


15 


15 


ns 


40 


th(XFD XF1 hold time after H1 low 








ns 



H3 



H1 



MSTRB 
(X)R/W 

(X)A 

(X)D 

(X)RDY 

XFO Pin 
XF1 Pin 



Fetch 
LDFI or LDII | Decode 



Read 



Execute 



!v / 



38 



39 -X |4- 



+-* W- 40 



X 



X 



X 



X 



<z> 
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timing for XFO when executing a STFI or STII 



NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


41 


>d(H3H-XF0H) H3 high to XFO high 


20 


20 


ns 



Fetch 
STFI or STII | Decode 




MSTRB 

(X)R/W 

(X)A 

(X)D 

(X)RDY 

XFO Pin 



V 



7 



N f 



< 



41 



f 



yz 



y- 



timing for XFO and XF1 when executing a SIGI 








NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


41 


'd(H3H-XF0L) H3 high to XFO low 


15 


15 


ns 


42 


<d(H3H-XF0H) H3 high to XFO high 


20 


20 


ns 


43 


'sufXFI) XF1 valid before H1 low 


12 


12 


ns 


44 


l h(XF1) ^ F1 hold time after H1 low 








ns 



H3 



H1 



XFO 



XF1 



Fetch 
SIGI 



Decode 



Read 



Execute 



Y_/~~\_/T^^\ 
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timing for loading XF register when configured as an output pin 



NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


45 


<v(H3H-XR H 3 hi 9 h to XF valid 


20 


20 


ns 




Decode 



Read 



Execute 



AA_^A_A\^TA. 



H1 



■w~~\_^~\_^~\_/~x_/~ 



OUTXF Bit 



XFPin 




1 orO 



— * (4— 45 



X 



change of XF from output to input mode 








NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


46 


'h(H3H-XFOI) XF hold after H3 high . 


20 t 


20 t 


ns 


47 


'su(XF) XF setup before H1 low 


12 


12 


ns 


48 


l h(XF) Xfr hold time after H1 low 








ns 



t These values derived from characterization but not tested. 



H3 



H1 



IOXF Bit 



XFPin 



INXF Bit 



Synchronizer I Value on Pin I 




Data 
Seen 
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change of XF from input to output mode 


NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


49 


H3 high to XF switching from 
'd(H3H-XFIO) j n p U t t0 output 


20 


20 


ns 



Execution of 
Load of IOF 



H3 



H1 



IOXF Bit 



XFPIn 



49 



< 
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reset timing 


NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN 


MAX 


MIN 


MAX 


50 


'su(RESET) 


Setup for RESET before CLKIN low 


10 


10 


ns 


51 


l d(CLKINH-H1H) 


CLKINhightoHl high 


5 


18 


5 


18 


ns 


52 


'd(CLKINH-HIL) 


CLKIN high to H1 low 


5 


18 


5 


18 


ns 


53 


tsufRESETH-HIL) 


Setup for RESET high before H1 low and after 
10 H1 clock cycles 


15t 


15t 


ns 


54 


td(CLKINH-H3L) 


CLKIN high to H3 low 


5 


18 


5 


18 


ns 


55 


td(CLKINH-H3H) 


CLKIN high to H3 high 


5 


18 


5 


18 


ns 


56 


tdis(HIH-XD) 


H1 high to (X)D three state 


20t 


20t 


ns 


57 


tdis(H3H-XA) 


H3 high to (X)A three state 


12t 


12t 


ns 


58 


l d(H3H-CONTROLH) 


H3 high to control signals high 


iot 


iot 


ns 


59 


td(HIH-IACKH) 


H1 high to JACK high 


12t 


12t 


ns 


60 


<dis(RESETL-ASYNCH) 


RESET low to asynchronously reset signals 
three state 


25t 


25t 


ns 



t These values derived from characterization but not tested. 



CLKIN 



RESET \ 
(see Notes 20, 21) v 



(X)D 
(see Note 16) 

(X)A 
(see Note 17) 



Control Signals 
(see Note 18) 



IACK 

Asynchronous 
Reset Signals 
(see Note 19) 




NOTES: 16. (X)D includes D31 -DO and XD31-XD0. _ 
17. X(A) includes A23-A0, XA12-XA0, and RAty. 



18. Control signals include STRB, MSTRB, and IOSTRB. 

1 9. Asynchronously reset signals include XF1 , XF0, CLKXO, DXO, FSXO, CLKRO, DRO, FSRO, CLKX1 , DX1 , FSX1 , CLKR1 , DR1 , FSR1 , 
TCLKO, and TCLK1 . 

20. RESET is an asynchronous input and can be asserted at any point during a clock cycle. If the specified timings are met, the exact 
sequence shown_will occurj_otherwise, an additional delay of one clock cycle may occur. 

21 . Note that the RAW and XRA/V outputs are placed in a high impedance state during reset and can be provided with a resistive pull-up, 
nominally 20 kQ, if undesirable spurious writes could be caused when these outputs go low. 
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INT3-INT0 response timing 



NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN NOM MAX 


MIN NOM MAX 


61 


, su(INT) INT3-INT0 setup before H1 low 


15 


15 


ns 


62 


l w(INT) Interrupt pulse width to guarantee one 
(see Note 22) interrupt seen 


P 1.5P <2Pt 


P 1.5P <2Pt 


ns 



t These values derived from characterization but not tested. P = One H1 period. 

NOTES: 22. Interrupt pulse width must be at least 1 P wide to guarantee it will be seen. It must be less than 2 P wide to guarantee it will be responded 

to only once. The recommended pulse width is 1 .5 P. 
23. INT is an asynchronous input and can be asserted at any point during a clock cycle. If the specified timings are met, the exact sequence 

shown will occur; otherwise, an additional delay of one clock cycle may occur. 

Fetch First 

Instruction of 

Service Routine I 




ADDR 



Data 



o 



o 
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iming 










lACKt 




NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


63 


t<j(H1H-IACKL) 


H1 high to IACK low 


12 


12 


ns 


64 


•d(HIH-IACKH) 


H1 high to IACK high during first cycle of JACK instruction 
data read 


12 


12 


ns 



NOTE 24: The IACK output is active for the entire duration of the bus cycle and is therefore extended if the bus cycle utilizes wait states. 



Fetch IACK 
Instruction 



H3 



IACK 
Data Read 



H1 



IACK 



63 -W |4- 



/ 



64 



ADDR 



xzzx 



DATA 



o 
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serial port timing 


NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


65 


'd(HI-SCK) H1 hi 9 h ,0 internal CLKX/R 


17 


17 


ns 


66 


'c(SCK) CLKX/R cycle time 


CLKX/R ext 


t C (H)x2.5 


t c(H )x2.5 


ns 


CLKX/R int 


<c(H)*2 2 t c (H)x2 32 * 


l c(H)><2 2<c(H)*2 32 t 


67 


CLKX/R high/low 
l w(SCK) pu | se width 


CLKX/R ext 


tc(H)+15t 


tc(H)+15+ 


ns 


CLKX/R int 


[tc(SCK)/2]-15 [t c (SCK)/2]+5 


Pc(SCK)/2J-15 [t C (SCK)/2]+5 


68 


'r(SCK). CLKX/R rise time 


8t 


8t 


ns 


69 


'f(SCK) CLKX/R fall time 


8t 


8t 


ns 


70 


'd(DX) CLKX to DX valid 


CLKX ext 


35 


35 


ns 


CLKX int 


20 


20 


71 


DR setup before 
tsu(DR) CLKX low 


CLKR ext 


10 


10 


ns 


CLKR int 


25 


25 


72 


DR hold from 
'h(DR) CLKRIow 


CLKR ext 


10 


10 


ns 


CLKR int 


ot 


ot 


73 


. CLKX to internal 
ki(FSX) FSX high/low 


CLKX ext 


32 


32 


ns 


CLKX int 


17 


17 


74 


, FSR setup before 
tsu(FSR) CLKR , ow 


CLKR ext 


10 


10 


ns 


CLKR int 


10 


10 


75 


th(FS) FSX/R input hold 


CLKX/R ext 


10 


10 


ns 


CLKX/R int 


ot 


ot 


76 


External FSX setup 
! su(FSX) before CLKX 


CLKX ext 


-[tc(H)-8] [t C (CLK)/2]-10* 


-Pc(H)-8] [tc(CLK)/2]-10* 


ns 


CLKX int 


-[tc(H)-21] t c(C LKX)/2* 


-I«c(H)-21] tc(CLKX)/2* 


77 


CLKX to first DX bit, 
td(CH-DX)V p SX precedes 
v ; CLKX high 


CLKX ext 


36 


36 


ns 


CLKX int 


21 


21 


78 


FSX to first DX bit, CLKX precedes 
td(FSX-DX)V FSX 


36 


36 


ns 


79 


CLKX high to DX high Z following 
>dDXZ | as t data bit 


20t 


20t 


ns 



t These values derived from characterization but not tested. 
* These values derived by design but not tested. 
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serial port timing, fixed data rate mode 



CLKX/R 



FSR 



FSX(INT) 




FSX(EXT) _/\ 



NOTES: 25. Timing diagrams show operations with CLKXP = CLKRP = FSXP = FSRP = 0. 

26. These timings are valid for all serial port modes, including handshake, except where otherwise indicated. 

serial port timing, variable data rate mode 

CLKX/R / \ 




H-72 



NOTES: 27. Timings are valid for all serial port modes, including handshake, except where otherwise indicated. 
25. Timing diagrams show operations with CLKXP = CLKRP = FSXP = FSRP = 0. 
28. Timings not expressly specified for variable data rate mode are the same as those for fixed data rate mode. 



Texas ^^ 
Instruments 

POST OFFICE BOX 1443 • HOUSTON, TEXAS 77001 



E-25 



SMJ320C30 

DIGITAL SIGNAL PROCESSOR 



SGUS01" 


— FEBRUARY 1991 
























HOLD/HOLDA timing 




NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN 


MAX 


MIN 


MAX 


80 


'su(HOLD) HOLD valid before H1 low 


15 


15 


ns 


81 


'v(HOLDA) HOLDA valid after H1 low 


0* 


10 


0* 


10 


ns 


82 




2 


2 


H1 
cycles 


'w(HOLD) HOLD low width 


83 


'w(HOLDA) HOLDA low width 


tcH-St 


tcH-5t 


ns 


84 


'd(H1 L-SH)H H1 low to STRB high for a HOLD 


0* 


iot 


0* 


10t 


ns 


85 


tdisfHIL-S) H1 low to STRB high impedance state 


0* 


iot 


0* 


10t 


ns 


86 


'en(HIL-S) H1 low to STRB active 


0* 


10t 


0* 


10t 


ns 


87 


tdisfHIL-RW) H1 low to RAV high impedance state 


0* 


iot 


0* 


iot 


ns 


88 


'en(HIL-RW) H1 low to R/W active 


0* 


10t 


0* 


iot 


ns 


89 


l dis(H1L-A) H1 low ,0 address high impedance state 


0* 


15t 


ot 


15t 


ns 


90 


'en(HIL-A) H1 low to address valid 


0* 


15t 


0* 


15t 


ns 


91 


'dis(HIH-D) H1 high to data impedance state 


0* 


15t 


ot 


15t 


ns 



t These values derived from characterization but not tested. 
* These values derived by design but not tested. 

NOTE 29: HOLD is an asynchronous input and can be asserted at any point during a clock cycle. If the specified timings are met, the exact 
sequence shown will occur; otherwise, an additional delay of one clock cycle may occur. 
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peripheral pin general-purpose I/O timing 


NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


92 


tsu(GPIOHIL) 


General-purpose input setup time before H1 low 


15 


15 


ns 


93 


th(GPIOHIL) 


General-purpose input hold time after H1 low 








ns 


94 


td(GPIOHIH) 


General-purpose output delay after H1 low 


15 


15 


ns 



NOTE 30: Peripheral pins include CLKXO/1 , CLKRO/1 , DXO/1 , DRO/1 , FSXO/1 . FSRO/1 , and TCLKO/1 . The modes of these pins are defined by 
the contents of internal control registers associated with each peripheral. 




timer pin timing 



NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


78 


'su(TCLKH1L) 


TCLK setup before H1 low 


15 


15 


ns 


79 


thfTCLKHIL) 


TCLK hold after H1 low 








ns 


80 


'dfrCLKHIW 


TCLK valid after H1 high 


15 


15 


ns 



NOTE 31: Period and polarity of valid logic level are specified by contents of internal control registers. 



Peripheral V 
Pin /x 
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change of peripheral pin from general purpose output to input mode 



NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


98 


l h(H3H) 


Hold after H1 high 


15t 


15t 


ns 


99 


'su(GPIOHIL) 


Peripheral pin setup before H1 low 


15 


15 


ns 


100 


'WGPIOHIL) 


Peripheral pin hold after H1 low 








ns 



t These values derived from characterization but not tested. 



Value on Pin 

Seen In 

Perlphera 

Control Register 




Peripheral 

pin Output 



Data Bit 



Data 
Seen 
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change of peripheral pin from general-purpose input to output mode 



NO. 


PARAMETER 


SMJ320C30-28 


SMJ320C30-25 


UNIT 


MIN MAX 


MIN MAX 


101 


'd(GPIOHIH) H1 high to peripherial pin switching from input to output 


15 


15 


ns 



H3 



Execution of Store of 

Peripheral Control 

Register 



H1 



IO Control 
Bit 



Peripheral 
Pin 



101 



< 
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SMJ320C30 part order information 


DEVICE 


TECHNOLOGY 


POWER 
SUPPLY 


OPERATING 
FREQUENCY 


PACKAGE 
TYPE 


PROCESSING 
LEVEL 


SMJ320C30GBM25 


1.0-nm CMOS 


5V±10% 


25 MHz 


Ceramic 181 -pin PGA 


Class B 


SM320C30GBM25 


1.0-umCMOS 


5V* 10% 


25 MHz 


Ceramic 181 -pin PGA 


Std 


SMJ320C30GBM28 


1.0-um CMOS 


5Vt 5% 


28 MHz 


Ceramic 181 -pin PGA 


Class B 


SM320C30GBM28 


1.0-umCMOS 


5 V ± 5% 


28 MHz 


Ceramic 181 -pin PGA 


Std 


SMJ320C30HUM25 


1.0-um CMOS 


5 V ± 1 0% 


25 MHz 


Ceramic 196-pin gullwing leaded carrier 


Class B 


SM320C30HUM25 


1.0-um CMOS 


5 V±10% 


25 MHz 


Ceramic 196-pin gullwing leaded carrier 


Std 


SMJ320C30HUM28 


1.0-um CMOS 


5 V ± 5% 


28 MHz 


Ceramic 196-pin gullwing leaded carrier 


Class B 


SM320C30HUM28 


1.0-um CMOS 


5 V ± 5% 


28 MHz 


Ceramic 196-pin gullwing leaded carrier 


Std 


SMJ320C30HTM25 


1.0-um CMOS 


5V±10% 


25 MHz 


Ceramic 196-pin quad flatpack 


Class B 


SM320C30HTM25 


1.0-umCMOS 


5 V ± 1 0% 


25 MHz 


Ceramic 196-pin quad flatpack 


Std 


SMJ320C30HTM28 


1.0-umCMOS 


5Vt 5% 


28 MHz 


Ceramic 196-pin quad flatpack 


Class B 


SM320C30HTM28 


1.0-umCMOS 


5 V t 5% 


28 MHz 


Ceramic 196-pin quad flatpack 


Std 



device nomenclature 



SMJ 



320 



30 



GB 



28 



Prefix 

SMJ = Class B 
SM = STD 



Device Family 



320 = SMJ320 family 



Technology 

C = CMOS 

E = CMOSEPROM 

No letter = NMOS 



Device 



Speed Range 

25 = 25 MHz 
28 = 28 MHz 

Temperature Range 

M = - 55"C to 1 25°C 
L = 0°Cto 70°C 



Package Type 

GB = pin grid array (PGA) 

HT = 196 pin quad flatpack — flat lead 

HU = 196 pin quad flatpack — gullwing 
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HT 196-lead ceramic quad flatpack 



6,35 (0.250) Min Typ 



Pin 1 Indicator 



I 



0,20 (0.008) 



0,10(0.004) 
0,36(0.014) _ 
0,05 (0.002) 

(At Braze Pads) 



48,6(1.915) 
47,8(1.880) 

34,7(1.365) 



Typ 



33,9(1.335) 
30,5 (1.200) Ref. Typ 




0,25 (0.010) I 
0,15 (0.006) 



<t<t 



0,64 (0.025) Typ 



Jl 



0,25(0.010) Max 



Detail "A" 



Thermal Resistance Characteristics 



2,67 (0.105) Max 



Parameter 


°c/w 


R9jc 
R0JA 


1.0 
28.5 



3,30 (0.130) Max 



Detail "A" 

ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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HU 196-lead ceramic leaded chip carrier (gullwing) 

39,5 (1.555) 



2,55 (0.100) Typ 
Pin 1 Indicator 



0,64 (0.025) Mln 





0,64 (0.025) Typ 



Detail "A" 



0,76 (0.030) 
r~ 0,51 (0.020) 



Thermal Resistance Characteristics 



2,67 (0.105) Max 
0,20 (0.008) 



0,10(0.004) 
0,36 (0.014) 
0,05(0.002) 

(At Braze Pads) 



Parameter 


°C/W 


R0JC 
R0JA 


1.0 
28.5 



3,30 (0.130) Max 



Detail "A" 



ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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GB 181-pin ceramic pin grid array 



Top View 



Pin A1 
Corner Indicator 




Thermal Resistance Characteristics 



Parameter 


°c/w 


Air Flow 
LFPM 


R0JC 


3.8 


N/A 


R0JA 
R©JA 


21.6 
13.3 




200 


R©JA 
R©JA 


9.7 
8.3 


400 
600 


RQja 
RQja 


7.4 
7.1 


800 
1000 



4,70(0.185) 

3,55 (0.140) 1 



37,6(1.480) 
Side View 



T^-tl 



3,56(0.140) 



3,05 (0.120) 



2,54 (0.100) Typ- 



0,508 (0.020) 

0,406(0.016) 

(180 Places) 

Typ 



1,40(0.055) 
I I 1,14(0.045) 

-tt ; t ' 

— H k- 1,27(0 



050) Norn 
Diameter 
(4 Places) 



<t <t 



35,6(1.400) 
Ref 



H H Bottom View 
®)®®®®®®®®®®®®®(® 



® ®®®®®®®®®®®®® ® 
® ®®®®®®®®®®®®® ® 
® ®®®®®®®®®®®®® ® 
®®®® ® ®®®® 
®®®® ®®®® 

®®®® ®®®® 

®®®@® ®®®®® 

®®®® Extra Pin ©®®® 
®®®® /~ ®®®® 
®®®®® ® ®®@® 
® ®®®®®®®®®®®®® ® 
® ®®®®®®®®®®®®® ® 
® ®®®®®®®®®®®®® ®- 
ll®®®®®®®®®®®®®# 



r 



2,54 (0.100) Typ 



Index Corner 



_/* 1. 2 3 4 5 6 7 8 "9 10 11 12 1314 15 ' 

ALL LINEAR DIMENSIONS ARE IN MILLIMETERS AND PARENTHETICALLY IN INCHES 
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Architectural Overview 
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Program Flow Control 
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Quick Reference Guide 



Chapter F 



Quick Reference 



This appendix is a collection of the most-referenced material in this user's 
guide. 

Figure F-1 TMS320C3x Block Diagram F-4 

Figure F-2 TMS320C3x Block Diagram F-5 

Figure F-3 Extended-Precision Register Floating-Point Format F-7 

Figure F-4 Extended-Precision Register Integer Format F-7 

Figure F-5 Data-Page Pointer (DP) Register Format F-7 

Figure F-6 Index Register (IRx) Format F-7 

Figure F-7 Block-Size (BK) Register Format F-7 

Figure F-8 Status Register F-8 

Figure F-9 CPU/DMA Interrupt Enable Register (IE) F-9 

Figure F-1 CPU Interrupt Flag (IF) Register Format F-1 

Figure F-1 1 IOF Register Format F-1 1 

Figure F-1 2 TMS320C30 Memory Map F-1 2 

Figure F-1 3 TMS320C31 Memory Map F-1 3 

Figure F-1 4 Reset, Interrupt, and Trap Vector Locations F-1 4 

Figure F-1 5 Reset, Interrupt, and Trap Vector Format F-1 5 

Figure F-1 6 Peripheral-Bus Memory-Map Registers F-1 6 

Figure F-1 7 Memory-Mapped Locations for a DMA Channel F-1 7 

Figure F-1 8 DMA Global-Control Register Format F-1 8 

Figure F-1 9 Memory-Mapped Timer Locations F-20 

Figure F-20 Timer Global-Control Register F-21 

Figure F-21 Memory-Mapped Serial-Port Locations F-23 

Figure F-22 Serial-Port Global-Control Register Format F-24 

Figure F-23 FSX/DX/CLKX Port Control Register Format F-27 

Figure F-24 FSR/DR/CLKR Port Control Register Format ..- F-28 

Figure F-25 Receive/Transmit Timer Control Register Format F-29 

Figure F-26 Memory-Mapped External Interface Control Registers F-31 

Figure F-27 Primary-Bus Control Register Format F-32 

Figure F-28 Expansion-Bus Control Register Format F-33 

Table F-1 Feature Set Comparison F-2 

Table F-2 TMS320C31 Reserved Memory Locations F-3 

Table F-3 CPU Register/Assembler Syntax and Function F-6 

Table F-4 BNKCMP and Bank Size F-34 

Table F-5 Indirect Addressing F-35 

Table F-6 Instruction Set Summary F-37 

Table F-7 Parallel Instruction Set Summary F-42 
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F.1 TMS320C30 and TMS320C31 Differences 



This section addresses the major memory access differences between the 
TMS320C31 and the TMS320C30 devices. Observance of these consider- 
ations is critical for achieving design goal success. Table F-1 shows these dif- 
ferences. 



Table F-1. Feature Set Comparison 



Feature 


Device 


TMS320C31 


TMS320C30 


Data/program bus 


Primary bus: one bus composed 
of a 32-bit data and a 24-bit ad- 
dress bus 


Two buses: 

1) Primary bus: a 32-bit data and a 
24-bit address 

2) Expansion bus: a 32-bit data and a 
1 3-bit address 


Serial I/O ports 


1 serial port (SPO) 


2 serial ports (SPO, SP1 ) 


User program/data ROM 


Not available 


4K words/1 6K bytes 


Program boot loader 


User selectable 


Not available 



F.1.1 Data/Program Bus Differences 



The TMS320C31 uses only the primary bus and reserves the memory space 
that was previously used for expansion bus operations. 



Be careful! Program access to a reserved area produces unpredict- 
able results. 



F.1 .2 Serial Port Differences 



Serial port 1 references in Section 8.2 of the TMS320C3x User's Guide are not 
applicable to the TMS320C31 . The memory locations identified for the asso- 
ciated control registers and buffers are reserved. 



F.1 .3 Reserved Memory Locations 



Table F-2 identifies TMS320C31 reserved memory locations in addition to 
those shown in Table 3-8. 
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TMS320C30and TMS320C31 Differences 



Table F-2. TMS320C31 Reserved Memory Locations 



Feature 


Device 


TMS320C31 


TMS320C30 


OxOOOOOO-OxOOOFFF 


Reservedt 


Microcomputer program/data ROM modet 


0x800000-0x801 FFF 


Reserved 


Expansion bus MSTRB space 


0x804000-0x805FFF 


Reserved 


Expansion bus iOSTRB space 


0x808050 


Reserved 


SP1 global-control register 


0x808052-0x808056 


Reserved 


SP1 local-control registers 


0x808058 


Reserved 


SP1 data-transmit buffer 


0X80805C 


Reserved 


SP1 receive-transmit buffer 


0x808060 


Reserved 


Expansion bus control register 



t Applies to the MCBL and MC modes only. 



F.1 .4 Effects on the IF and IE Interrupt Registers 



The bits associated with serial port 1 in the IE (interrupt enable) register and 
the IF (interrupt flag) register for the TMS320C30 are not applicable to the 
TMS320C31 . Write only logic data to IE register bits 6, 7, 22, and 23 and to 
IF register bits 6 and 7. Writing logic 1s to these bits produces unpredictable 
results. 



F.1 .5 User Program/Data ROM 



The user program/data ROM that is available for the TMS320C30 device does 
not existforthe TMS320C31 . Rather, the memory locations that were allocated 
to support user program/data ROM operations have been reserved on the 
TMS320C31 to support microcomputer/boot loader accessing. See Chap- 
ter 3 for more information onlising the microcomputer/boot loader function. 



F.1. 6 Development Considerations 



For users who are developing application code using aTMS320C3x simulator, 
XDS, or ASM/LNK, Tl recommends that you modify the .cfm and .cmdfiles by 
removing these memory spaces from the tool's configured memory. This en- 
sures that your developed application performs as expected when the 
TMS320C31 device is used. 
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F.2 TMS320C3x Architecture 



Figure F-1 and Figure F-2 show the TMS320C3x's register-based CPU archi- 
tecture with internal and external buses, peripherals, DMA, and memory orga- 
nization. 



Figure F-1. TMS320C3x Block Diagram 
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Figure F-2. TMS320C3x Block Diagram 
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F.3 CPU Register File 



The TMS320C30 provides 28 registers in a multiport register file that is tightly 
coupled to the CPU. The PC is not included in the 28 registers. All of these reg- 
isters can be operated upon by the multiplier and ALU and can be used as gen- 
eral-purpose 32-bit registers. 



F.3.1 Register Addressing 



The TMS320C30 provides 28 registers in a multiport register file that is tightly 
coupled to the CPU. The PC is not included in the 28 registers. All of these reg- 
isters can be operated upon by the multiplier and ALU and can be used as gen- 
eral-purpose 32-bit registers. 



Table F-3. CPU Register/Assembler Syntax and Function 



CPU Register 
Address 


Assembler 
Syntax 


Assigned 
Function 


OOh 
01 h 
02h 
03h 
04h 
05h 
06h 
07h 


R0 
R1 
R2 
R3 
R4 
R5 
R6 
R7 


Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 
Extended-precision register 


08h 
09h 
OAh 
OBh 
OCh 
ODh 
OEh 
OFH 


ARO 
AR1 
AR2 
AR3 
AR4 
AR5 
AR6 
AR7 


Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 
Auxiliary register 


10h 
11h 
12h 
13h 
14h 


DP 
IRO 
IR1 
BK 
SP 


Data-page pointer 
Index register 
Index register 1 
Block-size register 
Active stack pointer 


15h 
16h 
17h 
18h 


ST 
IE 
IF 
IOF 


Status register 
CPU/DMA interrupt enable 
CPU interrupt flags 
I/O flags 


19h 
1Ah 
1Bh 


RS 
RE 
RC 


Repeat start address 
Repeat end address 
Repeat counter 
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CPU Register File 



Figure F-3. Extended-Precision Register Floating-Point Format 
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32 31 30 



e 


s 
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mantissa • 



Figure F-4. Extended-Precision Register integer Format 



39 




32 31 







unchanged 


signed or unsigned integer 



The eight 32-bit auxiliary registers (ARO — AR7) can be modified by the two 
Auxiliary Register Arithmetic Units (ARAUs). The primary function of the auxil- 
iary registers is the generation of 24-bit addresses, especially for use in indirect 
addressing. 



Figure F-5. Data-Page Pointer (DP) Register Format 



Figure F-6. Index Register (IRx) Format 



31 






8 


7 







X 


X ... X 


X 


page 


t 
31 


24 23 









X 


X 


index 



Figure F-7. Block-Size (BK) Register Format 
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Figure F-8. Status Register 
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XX 


15 


14 


13 


12 


11 


10 


9 


8 
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6 


5 


4 


3 
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XX 


XX 


GIE 


CC 


c. 


c F 


XX 


RM 


OVM 
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LV 


UF 


N 


Z 


V 


C 



NOTE: 



F!/W R/W R/W R/W 



xx = reserved bit. 
R = read, W = write. 



R/W R/W R/W R/W R/W R/W R/W R/W R/W 



Status Register Bits Summary 



Bit 


Name 


Reset Value 


Function 


ot 


C 





Carry flag. 


It 


V 





Overflow flag. 


2t 


Z 





Zero flag. 


3t 


N 





Negative flag. 


4t 


UF 





Floating-point underflow flag. 


5t 


LV 





Latched overflow flag. 


6t 


LUF 





Latched floating-point underflow flag. 


7 


OVM 





Overflow mode flag. This flag affects only the integer operations. If OVM 
= 0, the overflow mode is turned off; integer results that overflow are 
treated in no special way. If OVM = 1 , 

a) integer results overflowing in the positive direction are set to the 
most positive 32-bit twos-complement number (7FFFFFFFh) 

b) integer results overflowing in the negative direction are set to the 
most negative 32-bit twos-complement number (80000000h). 

Note that the function of V and LV is independent of the setting of OVM. 


8 


RM 





Repeat mode flag. If RM = 1 , the PC is being modified in either the 
repeat-block or repeat-single mode. 


9 


Reserved 





Read as 0. 


10 


CF 





Cache freeze. When CF = 1 , the cache is frozen. If the cache is enabled 
(CE = 1 ), fetches from the cache are allowed, but no modification of the 
state of the cache is performed. This function can be used to save fre- 
quently used code resident in the cache. At reset, is written to this bit. 
Cache clearing (CC=1 ) is allowed when CF=0. 


11 


CE 





Cache enable. CE = 1 enables the cache, allowing the cache to be used 
according to the least recently used (LRU) cache algorithm. CE = dis- 
ables the cache; no update or modification of the cache can be per- 
formed. No fetches are made from the cache. This function is useful for 
system debug. At system reset, is written to this bit. Cache clearing 
(CC = 1 ) is allowed when CE=0. 


12 


CC 





Cache clear. CC = 1 invalidates all entries in the cache. This bit is always 
cleared after it is written to and thus always read as 0. At reset, is written 
to this bit. 


13 


GIE 





Global interrupt enable. If GIE = 1 , the CPU responds to an enabled inter- 
rupt. If GIE = 0, the CPU does not respond to an enabled interrupt. 


15 — 14 


Reserved 





Read as 0. 


31—16 


Reserved 


0-0 


Value undefined. 



t The seven condition flags (ST bits 6 — 0) are defined in Section 1 0.2 on page 1 0-9. 
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Figure F-9. CPU/DMA Interrupt Enable Register (IE) 



CPU Register File 
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NOTE: 



xx = reserved bit, read as 0. 
R = read, W = write. 



IE Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





EINTO 





Enable external interrupt (CPU) 


1 


. EINT1 





Enable external interrupt 1 (CPU) 


2 


EINT2 





Enable external interrupt 2 (CPU) 


3 


EINT3 





Enable external interrupt 3 (CPU) 


4 


EXINTO 





Enable serial-port transmit interrupt (CPU) 


5 


ERINTO 





Enable serial-port receive interrupt (CPU) 


6 


EXINT1 





Enable serial-port 1 transmit interrupt (CPU) 


7 


ERINT1 





Enable serial-port 1 receive interrupt(CPU) 


8 


ETINTO 





Enable timer interrupt (CPU) 


9 


ETINT1 





Enable timer 1 interrupt (CPU) 


10 


EDINT 





Enable DMA controller interrupt (CPU) 


15—11 


Reserved 





Value undefined 


16 


EINTO 





Enable external interrupt (DMA) 


17 


EINT1 





Enable external interrupt 1 (DMA) 


18 


EINT2 





Enable external interrupt 2 (DMA) 


19 


EINT3 





Enable external interrupt 3 (DMA) 


20 


EXINTO 





Enable serial-port transmit interrupt (DMA) 


21 


ERINTO 





Enable serial-port receive interrupt (DMA) 


22 


EXINT1 





Enable serial-port 1 transmit interrupt (DMA) 


23 


ERINT1 


o 


Enable serial-port 1 receive interrupt (DMA) 


24 


ETINTO 





Enable timer interrupt (DMA) 


25 


ETINT1 





Enable timer 1 interrupt (DMA) 


26 


EDINT 





Enable DMA controller interrupt (DMA) 


31—27 


Reserved 


0-0 


Value undefined 
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Figure F-10. CPU Interrupt Flag Register (IF) 

31 30 29 28 27 26 



| XX | XX | XX | XX | XX | XX j XX | XX | XX | XX | XX | XX | xx I XX j XX | xx I 



15 14 13 12 11 



| xx | DINT | TINT1 | TINTO | RINT1 j XINT1 | RINTO | XINTO | INT3 | INT2 \ INT1 \ INTO*] 



I XX I XX I XX I XX 



NOTE: 



xx = reserved bit, read as 0. 
R = read, W = write. 



R/W 



R/W R/W HAN HAN HAN 



IF Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





INTO 





External interrupt flag 


1 


INT1 





External interrupt 1 flag 


2 


INT2 





External interrupt 2 flag 


3 


INT3 





External interrupt 3 flag 


4 


XINTO 





Serial-port transmit interrupt flag 


5 


RINTO 





Serial-port receive interrupt flag 


6 


XINT1T 





Serial-port 1 transmit interrupt flag 


7 


RINT1T 





Serial-port 1 receive interrupt flag 


8 


TINTO 





Timer interrupt flag 


9 


TINT1 





Timer 1 interrupt flag 


10 


DINT 





DMA channel interrupt flag 


31—11 


Reserved 


0-0 


Value undefined 



t Reserved on TMS320C31. 
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CPU Register File 



Figure F-11. I/O Flag Register (IOF) 
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xx = reserved bit, read as 0. 
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R/W 



R/W 
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R/W 



IOF Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





Reserved 





Read as 0. 


1 


I/OXFO 





If I/OXFO = 0, XF0 is configured as a general-purpose input pin. 
If I/OXFO = 1 , XF0 is configured as a general-purpose output pin. 


2 


OUTXFO 





Data output on XF0. 


3 


INXFO 





Data input on XF0. A write has no effect. 


4 


Reserved 





Read as 0. 


5 


I/OXF1 





If I/OXF1 = 0, XF1 is configured as a general-purpose input pin. 
If I/OXF1 = 1 , XF1 is configured as a general-purpose output pin. 


6 


OUTXF1 





Data output on XF1 . 


7 


INXF1 





Data input on XF1 . A write has no effect. 


31—8 


Reserved 


0-0 


Read as 0. 
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F.4 Memory Maps 



The TMS320C3x memory map is divided into the following sections: program 
interrupt address, internal ROM, RAM, the peripheral bus, and memory- 
mapped peripheral registers. 



Figure F-1 2. TMS320C30 Memory Maps 
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(a) Microprocessor Mode 



(b) Microcomputer Mode 
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Figure F-13. TMS320C31 Memory Maps 
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(a) Microprocessor Mode 



(b) Microcomputer/Boot Loader Mode 
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F.4.1 Interrupts 

Figure F-14. Reset, Interrupt, and Trap Vector Locations 
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t Reserved on TMS320C31 
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Figure F-15. Reset, Interrupt, and Trap Vector Format 
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Reset and Interrupt Vector Locations 



Reset or 
Interrupt 


Vector 
Location 


Priority 


Function 


RESET 


Oh 





External reset signal input on the RESET 
pin. 


INTO 


1h 


1 


External interrupt input on the INTO pin. 


INTl 


2h 


2 


External interrupt input on the INT1 pin. 


INT2 


3h 


3 


External interrupt input on the 1NT2 pin. 


INT3 


4h 


4 


External interrupt input on the INT3 pin. 


XINTO 


5h 


5 


Internal interrupt generated when serial port 
transmit buffer is empty. 


RINTO 


6h 


6 


Internal interrupt generated when serial port 
receive buffer is full. 


XINTlt 


7h 


7 


Internal interrupt generated when serial port 
1 transmit buffer is empty. 


RINT1T 


8h 


8 


Internal interrupt generated when serial port 
1 receive buffer is full. 


TINTO 


9h 


9 


Internal interrupt generated by timer 0. 


TINT1 


OAh 


10 


Internal interrupt generated by timer 1 . 


DINT 


OBh 


11 


Internal interrupt generated by DMA control- 
ler 0. 



t Reserved on TMS320C31 . 
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F.4.2 Peripheral Bus 



Figure F-16. Peripheral-Bus Memory-Map Registers 
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808070h 
8097FFh 


Reserved 



t Reserved on TMS320C31 
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F.4.2.1 DMA Registers 

Figure F- 1 7. Memory-Mapped Locations for a DMA Channel 



Memory Maps 



Register 



DMA Global Control (See Table 8-7) 



Reserved 



Reserved 



Reserved 



DMA Source Address (subsection 8.3.2) 



Reserved 



DMA Destination Address (subsection 8.3.2) 



Reserved 



DMA Transfer Counter (subsection 8.3.3) 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Peripheral 
Address 

808000h 

808001 h 

808002h 

808003h 

808004h 

808005h 

808006h 

808007h 

808008h 

808009h 

80800Ah 

80800Bh 

80800Ch 

80800Dh 

80800Eh 

80800Fh 
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Figure F-18. DMA Global-Control Register Format 



31 


30 


29 


28 




27 


26 


25 


24 




23 


22 




21 




20 


19 




18 


17 


16 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


15 14 13 12 


11 


10 9 8 7 


6 5 4 


3 2 1 


XX 


XX 


XX 


XX 


TCINT 


TC 


SYNC 


DECDST 


INCDST 


DECSRC 


INCSRC 


STAT 


START 



R/W R/W R/W R/W 



R/W 



RAV 



R/W 



RA/V 



R RA/V RAV 



NOTE: xx = Reserved bit, read as 0. 
R = read, W = write. 



DMA Global-Control Register Bits Summary 



Bit 


Name 


Reset Value 


Function 


1 —0 


START 


0-0 


These bits control the state in which the DMA starts and stops. The 
DMA may be stopped without any loss of data. 


3 — 2 


STAT 


0-0 


These bits indicate the status of the DMA and change every cycle. 


4 


INCSRC 





If INCSRC = 1 , the source address is incremented after every read. 


5 


DECSRC 





If DECSRC = 1, the source address is decremented after every 
read. If INCSRC = DECSRC, the source address is not modified af- 
ter a read. 


6 


INCDST 





If INCDST = 1 , the destination address is incremented after every 
write. 


7 


DECDST 





If DECDST = 1 , the destination address is decremented after every 
write. If INCDST = DECDST, the destination address is not modified 
after a write. 


9 — 8 


SYNC 


0-0 


The SYNC bits determine the timing synchronization between the 
events initiating the source and the destination transfers. The inter- 
pretation of the SYNC bits is shown on next page. 


10 


TC 





The TC bit affects the operation of the transfer counter. If TC = 0, 
transfers are not terminated when the transfer counter becomes 
zero. If TC = 1 , transfers are terminated when the transfer counter 
becomes zero. 


11 


TCINT 





If TCINT = 1 , the DMA interrupt is set when the transfer counter 
makes a transition to zero. If TCINT = 0, the DMA interrupt is not set 
when the transfer counter makes a transition to zero. 


31—12 


Reserved 


0-0 


Read as zero. 



F-18 



Quick Reference 



Memory Maps 



DMA Global-Control Register Bits Summary (Concluded) 



START Bits and Operation of the DMA (Bits 0-1) 



START 


Function 


00 


DMA read or write cycles in progress will be completed; any data read will be ig- 
nored. Any pending read or write will be canceled. The DMA is reset so that when 
it starts, a new transaction begins; i.e., a read is performed. (Reset value) 


01 


If a read or write has begun, it is completed before it stops: for example, in the 
middle or at the end of a DMA transfer. If a read or write has not begun, no read 
or write is started. 


1 


If a DMA transfer has begun, the entire transfer is completed (including both read 
and write operations) before stopping. If a transfer has not begun, none is started. 


1 1 


DMA starts from reset or restarts from the previous state. 


STAT Bits and Status of the DMA (Bits 2-3) 


STAT 


Function 


00 


DMA is being held between DMA transfer (between a write and read). This is the 
value at reset. (Reset value) 


01 


DMA is being held in the middle of a DMA transfer, i.e., between a read and a write. 


1 


Reserved. 


1 1 


DMA busy; i.e., DMA is performing a read or write. 


SYNC Bits and Synchronization of the DMA (Bits 8-9) 


SYNC 


Function 


00 


No synchronization. Enabled interrupts are ignored. (Reset value) 


01 


Source synchronization. A read is performed when an enabled interrupt occurs. 


1 


Destination synchronization. A write is performed when an enabled interrupt oc- 
curs. 


1 1 


Source and destination synchronization. A read is performed when an enabled in- 
terrupt occurs. A write is then performed when the next enabled interrupt occurs. 
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F.4.2.2 Peripheral Timers 

Figure F-19. Memory-Mapped Timer Locations 



Register 



Timer Global Control (See Table 8-1) 



Reserved 



Reserved 



Reserved 



Timer Counter (See subsection 8.1 .2) 



Reserved 



Reserved 



Reserved 



Timer Period (See subsection 8.1 .2) 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Peripheral Address 
Timer Timer 1 



808020h 


808030h 


808021 h 


808031 h 


808022h 


808032h 


808023h 


808033h 


808024h 


808034h 


808025h 


808035h 


808026h 


808036h 


808027h 


808037h 


808028h 


808038h 


808029h 


808039h 


80802Ah 


80803Ah 


80802Bh 


80803Bh 


80802Ch 


80803Ch 


80802Dh 


80803Dh 


80802Eh 


80803Eh 


80802Fh 


80803Fh 
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Figure F-20. Timer Global-Control Register 



Memory Maps 



31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


15 


14 


13 


12 


11 


10 


9 


8 


7 


6 


5 


4 


3 


2 


1 





XX 


XX 


XX 


XX 


TSTAT 


INV 


CLKSRC 


C/P 


HLD 


GO 


XX 


XX 


DATIN 


DATOUT 


I/O 


FUNC 



R/W R/W R/W 

NOTE: xx = reserved bit, read as 0. 

R = read, W = write. 



R/W R/W R/W 



R/W 



R/W RA/V 



Timer Global-Control Register Bits Summary 



Bits 


Name 


Reset Value 


Function 





FUNC 





FUNC controls the function of TCLK. If FUNC = 0, TCLK is configured 
as a general-purpose digital I/O port. If FUNC = 1 , TCLK is configured 
as a timer pin (see Figure 8-7 for a description of the relationship 
between FUNC and CLKSRC). 


1 


I/O 





If FUNC = and CLKSRC = 0, TCLK is configured as a general-pur- 
pose I/O pin. In this case, if I/O = 0, TCLK is configured as a general- 
purpose input pin. If I/O = 1 , TCLK is configured as a general-purpose 
output pin. 


2 


DATOUT 





DATOUT drives TCLK when the TMS320C3x is in I/O port mode. 
DATOUT can also be used as an input to the timer. 


3 


DATIN 


X 


Data input on TCLK or DATOUT. A write has no effect. 


5 — 4 


Reserved 


0-0 


Read as 0. 


6 


GO 





The GO bit resets and starts the timer counter. When GO = 1 and the 
timer is not held, the counter is zeroed and begins incrementing on the 
next rising edge of the timer input clock. The GO bit is cleared on the 
same rising edge. GO = has no effect on the timer. 


7 


HLD 





Counter hold signal. When this bit is zero, the counter is disabled and 
held in its current state. If the timer is driving TCLK, the state of TCLK 
is also held. The internal divide-by-two counter is also held so that the 
counter can continue where it left off when HLD is set to 1 . The timer 
registers can be read and modified while the timer is being held. 
RESET has priority over HLD. Table 8-2 shows the effect of writing 
to GO and HLD. 


8 


C/P 





Clock/Pulse mode control. When C/P = 1 , clock mode is chosen, and 
the signaling of the status flag and external output will have a 50 
percent duty cycle. When C/P = 0, the status flag and external output 
will be active for one H1 cycle during each timer period (see 
Figure 8-4). 


9 


CLKSRC 





Specifies the source of the timer clock. When CLKSRC = 1 , an internal 
clock with frequency equal to one-half the H1 frequency is used to in- 
crement the counter. The INV bit has no effect on the internal clock 
source. When CLKSRC = 0, an external signal from the TCLK pin can 
be used to increment the counter. The external clock is synchronized 
internally, thus allowing external asynchronous clock sources that do 
not exceed the specified maximum allowable external clock frequen- 
cy. This will be less than f(H1 )/2. (See Figure 8-7 for a description of 
the relationship between FUNC and CLKSRC). 
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Timer Global-Control Register Bits Summary (Continued) 



Bits 


Name 


Reset Value 


Function 


10 


INV 





Inverter control bit. If an external clock source is used and INV = 1 , the 
external clock is inverted as it goes into the counter. If the output of the 
pulse generator is routed to TCLK and INV = 1 , the output is inverted 
before it goes to TCLK (see Figure 8-1). If INV = 0, no inversion is 
performed on the input or output of the timer. The INV bit has no effect, 
regardless of its value, when TCLK is used in I/O port mode. 


11 


TSTAT 





This bit indicates the status of the timer. It tracks the output of the 
uninverted TCLK pin. This flag sets a CPU interrupt on a transition from 
to 1 . A write has no effect. 


31—12 


Reserved 


0-0 


Read as 0. 



The result of a write using specified values of the GO and HLD bits in the global 
control register is shown below. 

Result of a Write of Specified Values of GO and HLD 



GO 


HLD 


Result 








All timer operations are held. No reset is performed. (Reset value) 





1 


Timer proceeds from state before write. 


1 





All timer operations are held, including zeroing of the counter. The GO bit 
is not cleared until the timer is taken out of hold. 


1 


1 


Timer resets and starts. 
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F.4.3 Serial Port 



Figure F-21. Memory-Mapped Serial-Port Locations 



Memory Maps 



Register 



Serial-Port Global Control 



Reserved 



FSX/DX/CLKX Port Control 



FSR/DR/CLKR Port Control 



R/X Timer Control 



R/X Timer Counter 



R/X Timer Period 



Reserved 



Data Transmit 



Reserved 



Reserved 



Reserved 



Data Receive 



Reserved 



Reserved 



Reserved 



Peripheral Address 


Serial 
PortO 


Serial 
Portlt 


808040h 


808050h 


808041 h 


808051 h 


808042h 


808052h 


808043h 


808053h 


808044h 


808054h 


808045h 


808055h 


808046h 


808056h 


808047h 


808057h 


808048h 


808058h 


808049h 


808059h 


80804Ah 


80805Ah 


80804Bh 


80805Bh 


80804Ch 


80805Ch 


80804Dh 


80805Dh 


80804Eh 


80805Eh 


80804Fh 


80805Fh 



t Reserved locations on the TMS320C31 
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Figure F-22. Serial-Port Global-Control Register Format 



R/W R/W R/W R/W R/W R/W 

NOTE: xx = Reserved bit, read as 0. 
R = read, W = write. 



R/W 



R/W 



R/W R/W R/W R 



31 


30 


29 


28 


27 




26 


25 




24 




23 


22 


21 




20 




19 


18 


17 


16 


XX 


XX 


XX 


XX 


RRESET 


XRESET 


RINT 


RTINT 


XINT j XTINT 


RLEN 


XLEN 


FSRP 


FSXP 


R/W R/W R/W R/W R/W R/W 
15 14 13 12 11 10 9 8 7 


R/W R/W RA/V R/W R/W R/W 
6 5 4 3 2 1 


DRP 


DXP 


CLKRP 


CLKXP 


RFSM 


XFSM 


RVAREN 


XVAREN 


RCLK 
SRCE 


XCLK 
SRCE 


HS 


RSR 
FULL 


XSR 
EMPTY 


FSXOUT 


XRDY 


RRDY 



Serial-Port Global-Control Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





RRDY 





If RRDY = 1 , the receive buffer has new data and is ready to be read. A three H1 /H3 
cycle delay occurs from the reading of DRR to RRD Y = 1 . The rising edge of this signal 
sets RINT. If RRDY= at reset, the receive buffer does not have new data since the 
last read. RRDY = at reset and after the receive buffer is read. 


1 


XRDY 


1 


If XRDY = 1 , the transmit buffer has written the last bit of data to the shifter and is ready 
for a new word. A three H1 /H3 cycle delay occurs from the loading of the transmit shift- 
er until XRDY is set to 1 . The rising edge of this signal sets XINT. If XRDY = O.the 
transmit buffer has not written the last bit of data to the transmit shifter and is not ready 
for a new word. XRDY = 1 at reset. 


2 


FSXOUT 





This bit configures the FSX pin as an input (FSXOUT = 0) or an outpu^FSXOUT = 1 ). 


3 


XSREMPTY 





If XSREMPTY = 0, the transmit shift register is empty. If XSREMPTY = 1 , the transmit 
shift register is not empty. Reset or XRESET causes this bit to = 0. 


4 


RSRFULL 





If RSRFULL = 1, an overrun of the receiver has occurred. In continuous mode, 
RSRFULL is set to 1 when both RSR and DRR are full. In noncontinuous mode, 
RSRFULL is set to 1 when RSR and DRR are full and a new FSR is received. A read 
causes this bit to be set to 0. This bit can be set to only by a system reset, a serial 
port receive reset (RRESET = 1 ), or a read. When the receiver tries to set RSRFULL 
to a 1 at the same time that the global register is read, the receiver will dominate and 
RSRFULL is set to 1 . If RSRFULL = 0, no overrun of the receiver has occurred. 


5 


HS 





If HS = 1 , the handshake mode is enabled. If HS = 0, the handshake mode is disabled. 


6 


XCLKSRCE 





If XCLKSRCE = 1 , the internal transmit clock is used. If XCLKSRCE = 0, the external 
transmit clock is used. 


7 


RCLKSRCE 





If RCLKSRCE = 1 , the internal receive clock is used. If RCLKSRCE = 0, the external 
receive clock is used. 


8 


XVAREN 





This bit specifies fixed (XVAREN = 0) or variable (XVAREN = 1) data rate signaling 
when transmitting. With a fixed data rate, FSX is active for at least one XCLK cycle 
and then goes inactive before transmission begins. With variable data rate, FSX is 
active while all bits are being transmitted. When you use an external FSX and variable 
data rate signaling, the DX pin is driven by the transmitter when FSX is held active 
or when a word is being shifted out. 


9 


RVAREN 





This bit specifies fixed (RVAREN = 0) or variable (RVAREN = 1 ) data rate signaling 
when receiving. With a fixed data rate, FSR is active for at least one RCLK cycle and 
then goes inactive before the reception begins. With variable data rate, FSR is active 
while all bits are being received. 
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Serial-Port Global-Control Register Bits Summary (Continued) 



Bit 


Name 


Reset Value 


Function 


10 


XFSM 





Transmitframe sync mode. Configures the port for continuous mode operation(XFSM 
= 1 ) or standard mode (XFSM = 0). In continuous mode, only the first word of a block 
generates a sync pulse, and the rest are simply transmitted continuously to the end 
of the block. In standard mode, each word has an associated sync pulse. 


11 


RFSM 





Receive frame sync mode. Configures the port for continuous mode (RFSM =1) or 
standard mode (RFSM = 0) operation. In continuous mode, only the first word of a 
block generates a sync pulse, and the rest are simply received continuously without 
expectation of another sync pulse. In standard mode, each word received has an 
associated sync pulse. 


12 


CLKXP 





CLKX polarity. If CLKXP = 0, CLKX is active high. If CLKXP = 1 , CLKX is active low. 


13 


CLKRP 





CLKR polarity. If CLKRP = 0, CLKR is active high. If CLKRP =1 , CLKR is active low. 


14 


DXP 





DX polarity. If DXP = 0, DX is active high. If DXP = 1 , DX is active low. 


15 


DRP 





DR polarity. If DRP = 0, DR is active high. If DRP = 1 , DR is active low. 


16 


FSXP 





FSX polarity. If FSXP = 0, FSX is active high. If FSXP = 1 , FSX is active low. 


17 


FSRP 





FSR polarity. If FSRP = 0, FSR is active high. If FSRP = 1 , FSR is active low. 


19 — 18 


XLEN 


00 


These two bits define the word length of serial data transmitted. All data is assumed 
to be right-justified in the transmit buffer when fewer than 32 bits are specified. 

0—8 bits 1 — 24 bits 

1 — 16 bits 1 1 — 32 bits 


21 —20 


RLEN 


00 


These two bits define the word length of serial data received. All data is right-justified 
in the receive buffer. 

0—8 bits 1 — 24 bits 

1—16 bits 1 1 — 32 bits 


22 


XTINT 





Transmit timer interrupt enable. If XTINT = 0, the transmit timer interrupt is disabled. 
If XTINT = 1 , the transmit timer interrupt is enabled. 


23 


XINT 





Transmit interrupt enable. If XINT = 0, the transmit interrupt is disabled. If XINT= 1 , 
the transmit interrupt is enabled. Note that the CPU transmit interrupt flag XINT is the 
logical OR of the enabled transmit timer interrupt and the enabled transmit interrupt. 


24 


RTINT 





Receive timer interrupt enable. If RTINT = 0, the receive timer interrupt is disabled. 
If RTINT = 1 , the receive timer interrupt is enabled. 


25 


RINT 





Receive interrupt enable. If RINT = 0, the receive interrupt is disabled. If RlNT= 1 , the 
receive interrupt is enabled. Note that the CPU receive interrupt flag RINT is the OR 
of the enabled receive timer interrupt and the enabled receive interrupt. 


26 


XRESET 





Transmit reset. If XRESET = 0, the transmit side of the serial port is reset. To take the 
transmit side of the serial port out of reset, set XRESET to 1 . However, do not set XRE- 
SET to 1 until at least three cycles after XRESET goes inactive. This applies only to 
system reset. Setting XRESET to does not change the contents of any of the serial- 
port controlregisters. It places the transmitter in a state corresponding to the begin- 
ning of a frame of data. Resetting the transmitter generates a transmit interrupt. Reset 
this bit during the time the mode of the transmitter is set. XFSM can be toggled without 
resetting the global-control register. 
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Serial-Port Global-Control Register Bits Summary (Concluded) 



Bit 


Name 


Reset Value 


Function 


27 


RRESET 





Receive reset. If RRESET = 0, the receive side of the serial port is reset. To take the 
receive side of the serial port out of reset, set RRESET to 1 . Setting RRESET to 
does not change the contents of any of the serial-port control registers. It places the 
receiver in a state corresponding to the beginning of a frame of data. Reset this bit 
at the same time the mode of the receiver is set. RFSM can be toggled without reset- 
ting the global-control register. 


31 —28 


Reserved 


0-0 


Read as 0. 
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F.4.4 FSX/DX/CLKX Port Control Register 

Figure F-23. FSX/DX/CLKX Port Control Register 



NOTE: 



31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


15 


14 


13 


12 


11 


10 


9 


8 


7 


6 


5 


4 


3 


2 


1 





XX 


XX 


XX 


XX 


FSX 
DATIN 


FSX 
DATOUT 


FSX 
I/O 


FSX 
FUNC 


DX 
DATIN 


DX 
DATOUT 


DX 

I/O 


DX 

FUNC 


CLKX 
DATIN 


CLKX 
DATOUT 


CLKX 

I/O 


CLKX 
FUNC 



R/W R/W 



xx = reserved bit, read as 0. 
R = read, W = write. 



FSX/DX/CLKX Port Control Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





CLKXFUNC 





CLKXFUNC controls the function of CLKX. If CLKXFUNC = 0, CLKX is confi- 
gured as a general-purpose digital I/O port. If CLKXFUNC = 1 , CLKX is a 
serial port pin. 


1 


CLKXT/O 





]f CLKX I/O = 0, CLKX is configured as a general-purpose input pin. If CLKX 
1/0 = 1, CLKX is configured as a general-purpose output pin. 


2 


CLKX DATOUT 





Data output on CLKX. 


3 


CLKXDATIN 


X 


Data input on CLKX. A write has no effect. 


4 


DXFUNC 





DXFUNC controls the function of DX. If DXFUNC = 0, DX is configured as 
a general-purpose digital I/O port. If DXFUNC = 1 , DX is a serial port pin. 


5 


DXT/O 





If DX I/O = 0, DX is configured as a general-purpose input pin. 
If DX 1/0 = 1, DX is configured as a general-purpose output pin. 


6 


DXDATOUT 





Data output on DX. 


7 


DXDATIN 


X 


Data input on DX. A write has no effect. 


8 


FSXFUNC 





FSXFUNC controls the function of FSX. If FSXFUNC = 0, FSX is configured 
as a general-purpose digital I/O port. If FSXFUNC = 1 , FSX is a serial port 
pin. 


9 


FSX T/O 





If FSX I/O = 0, FSX is configured as a general-purpose input pin. 
If FSX 1/0 = 1, FSX is configured as a general-purpose output pin. 


10 


FSXDATOUT 





Data output on FSX. 


11 


FSXDATIN 


X 


Data input on FSX. A write has no effect. 


31 — 12 


Reserved 


0-0 


Read as 0. 
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F.4.5 FSR/DR/CLKR Port Control Register 

Figure F-24. FSR/DR/CLKR Port Control Register 



31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


15 


14 


13 


12 


11 


10 


9 


8 


7 


6 


5 


4 


3 


2 


1 





XX 


XX 


XX 


XX 


FSR 
DATIN 


FSR 
DATOUT 


FSR 
I/O 


FSR 
FUNC 


DR 
DATIN 


DR 
DATOUT 


DR 

I/O 


DR 
FUNC 


CLKR 
DATIN 


CLKR 
DATOUT 


CLKR 
I/O 


CLKR 
FUNC 



R/W 



R/W 



RAV 



NOTE: xx = reserved bit, read as 0. 

R = read, W = write. 



FSR/DR/CLKR Port Control Register Bits Summary 



Bit 


Name 


Reset Value 


Function 





CLKRFUNC 





CLKRFUNC controls the function of CLKR. If CLKRFUNC = 0, CLKR is 
configured as a general-purpose digital I/O port. If CLKRFUNC = 1, 
CLKR is a serial port pin. 


1 


CLKRl/O 





If CLKRI/O = 0, CLKR is configured as a general-purpose input pin. 
If CLKRI/O = 1 , CLKR is configured as a general-purpose output pin. 


2 


CLKRDATOUT 





Data output on CLKR. 


3 


CLKRDATIN 


X 


Data input on CLKR. A write has no effect. 


4 


DRFUNC 





DRFUNC controls the function of DR. If DRFUNC = 0, DR is configured 
as a general-purpose digital I/O port. If DRFUNC = 1 , DR is a serial port 
pin. 


5 


DRl/O 





If DRl/O = 0, DR is configured as a general-purpose input pin. 
If DRl/O = 1 , DR is configured as a general-purpose output pin. 


6 


DRDATOUT 





Data output on DR. 


7 


DRDATIN 


X 


Data input on DR. A. write has no effect. 


8 


FSRFUNC 





FSRFUNC controls the function of FSR. If FSRFUNC = 0, FSR is confi- 
gured as a general-purpose digital I/O port. If FSRFUNC = 1 , FSR is a 
serial port pin. 


9 


FSR I/O 





]f FSR I/O = 0, FSR is configured as a general-purpose input pin. If FSR 
I/O = 1 ,FSR is configured as a general-purpose output pin. 


10 


FSRDATOUT 





Data output on FSR. 


11 


FSRDATIN . 


X 


Data input on FSR. A write has no effect. 


31 —12 


Reserved 


0-0 


Read as 0. 
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F.4.6 Receive/Transmit Timer Control Register 

Figure F-25. Receive/Transmit Timer Control Register 



31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 




XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX | 


15 


14 


13 


12 


11 


10 


9 


8 


7 


6 


5 


4 


3 


2 


1 





XX 


XX 


XX 


XX 


RTSTAT 


XX 


RCLKRC 


RC/P 


RHLD 


RGO 


XTSTAT 


XX 


XCLKSRC 


XC/P 


XHLD 


XGO | 



R/W 



R/W 



R/W 



R/W 



NOTE: 



xx. = reserved bit, read as 0. 
R = read, W = write. 



Receive/Transmit Timer Control Register 



Bit 


Name 


Reset Value 


Function 





XGO 





The XGO bit resets and starts the transmit timer counter. When XGO 
is set to 1 and the timer is not held, the counter is zeroed and begins 
incrementing on the next rising edge of the timer input clock. The XGO 
bit is cleared on the same rising edge. Writing to XGO has no effect 
on the transmit timer. 


1 


XHLD 





Transmit counter hold signal. When this bit is set to 0, the counter is dis- 
abled and held in its current state. The internal divide-by-two counter 
is also held so that the counter will continue where it left off when XHLD 
is set to 1 . The timer registers may be read and modified while the timer 
is being held. RESET has priority over XHLD. 


2 


XC/P 





XCIock/Pulse mode control. When XC/P - 1,theclockmodeis chosen. 
The signaling of the status flag and external output has a 50-percent 
duty cycle. When XC/P = 0, the status flag and external output are active 
for one CLKOUT cycle during each timer period. 


3 


XCLKSRC 





This bit specifies the source of the transmit timer clock. When 
XCLKSRC = 1 , an internal clock with frequency equal to one-half the 
CLKOUT frequency is used to increment the counter. When XCLKSRC 
= 0, an external signal from the CLKX pin can be used to increment the 
counter. The external clock source is SYNChronized internally, thus al- 
lowing for external aSYNChronous clock sources that do not exceed 
the specified maximum allowable external clock frequency, i.e., less 
thanf(H1)/2.6. 


4 


Reserved 





Read as zero. 


5 


XTSTAT 





This bit indicates the status of the transmit timer. It tracks what would 
be the output of the uninverted CLKX pin. This flag sets a CPU interrupt 
on a transition from to 1 . A write has no effect. 
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Receive/Transmit Timer Control Register (Concluded) 



Bit 


Name 


Reset Value 


Function 


6 


RGO 





The RGO bit resets and starts the receive timer counter. When RGO 
is set to 1 and the timer is not held, the counter is zeroed and begins 
incrementing on the next rising edge of the timer input clock. The RGO 
bit is cleared on the same rising edge. Writing to RGO has no effect 
on the receive timer. 


7 


RHLD 





Receive counter hold signal . When this bit is set to 0, the counter is dis- 
abled and held in its current state. The internal divide-by-two counter 
is also held so that the counter will continue where it left off when RHLD 
is set to 1 . The timer registers may be read and modified while the timer 
is being held. RESET has priority over RHLD. 


8 


RC/P 





RCIock/Pulse mode control. When RC/P = 1 , the clock mode is chosen. 
The signaling of the sjatus flag and external output has a 50-percent 
duty cycle. When RC/P = 0, the status flag and external output are ac- 
tive for one CLKOUT cycle during each timer period. 


9 


RCLKSRC 





This bit specifies the source of the receive timer clock. When RCLKSRC 
= 1 , an internal clock with frequency equal to one-half the CLKOUT fre- 
quency is used to increment the counter. When RCLKSRC = 0, an ex- 
ternal signal from the CLKR pin can be used to increment the counter. 
The external clock source is SYNChronized internally, thus allowing for 
external aSYNChronous clock sources that do not exceed the specified 
maximum allowable external clock frequency, i.e., less than f(H1)/2.6. 


10 


Reserved 





Read as zero. 


11 


RTSTAT 





This bit indicates the status of the receive timer. It tracks what would be 
the output of the uninverted CLKR pin. This flag sets a CPU interrupt 
on a transition from to 1 . A write has no effect. 


31—12 


Reserved 


0-0 


Read as 0. 
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F.4.7 Primary-Bus and Expansion-Bus Control 

Figure F-26. Memory-Mapped External Interface Control Registers 

Register 



Expansion Bus Control (See subsection 7.1 .2)t 



Reserved 



Reserved 



Reserved 



Primary Bus Control (See subsection 7.1.1) 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



Reserved 



t Reserved on the TMS320C31 



Peripheral 
Address 

808060h 

808061 h 

808062h 

808063h 

808064h 

808065h 

808066h 

808067h 

808068h 

808069h 

80806Ah 

80806Bh 

80806Ch 

80806Dh 

80806Eh 

80806Fh 



Memory Maps 



F-31 



Memory Maps 



F.4.8 Primary-Bus Control Register 

Figure F-27. Primary-Bus Control Register 



31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX XX 


XX XX 


15 


14 


13 


12 


11 10 9 


8 


7 6 5 


4 3 2 1 


XX 


XX 


XX 


BNKCMP 


WTCNT 


SWW 


HIZ NOHOLD HOLDST 



NOTE: 



R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W R/W 

xx = reserved bit, read as 0. 
R = read, W = write.. 

Primary-Bus Control Register Bits Summary 



Bit 


Name 


Reset 
Value 


Function 





HOLDST 


x 


Hold status bit. This bit signals whether the port is being held 
(HOLDST = 1) or is not being held (HOLDST = 0). This status bit is 
valid whether the port has been held via hardware or software. 


1 


NOHOLD 





Port hold siqnal. NOHOLD allows or disallows the port to be held by 
an external HOLD signal. When NOHOLD = 1 , the TMS320C3X takes 
over the external bus and controls it, regardless of serviced orpending 
requests by external devices. No hold acknowledge (HOLDA) is as- 
serted when a HOLD is received. However, it is asserted if an internal 
hold is generated (HIZ = 1 ). NOHOLD is set to at reset. 


2 


HIZ 





Internal hold. When set (HIZ = 1), the port is put in hold mode. This 
is equivalent to the external HOLD signal. By forcing a high-impe- 
dance condition, the TMS320C3x can relinquish the external memory 
port through software. HOLDA goes low when the port is placed in the 
high-impedance state. HIZ is set to at reset. 


4 — 3 


SWW 


11 


Software wait mode. In conjunction with WTCNT, this 2-bit field de- 
fines the mode of wait-state generation. It is set to 1 1 at reset. 


7 — 5 


WTCNT 


111 


Software wait mode. This 3-bit field specifies the number of cycles to 
use when in software wait mode for the generation of internal wait 
states. The range is zero (WTCNT = 0) to seven (WTCNT =111) 
H1/H3 cycles. It is set to 1 1 1 at reset. 


12—8 


BNKCMP 


10000 


Bank compare. This 5-bit field specifies the number of MSBs of the 
address to be used to define the bank size. It is set to 1 at reset. 


31 —13 


Reserved 


0-0 


Read as 0. 
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Memory Maps 



F.4.9 Expansion-Bus Control Register 

Figure F-28. Expansion-Bus Control Register 



31 


30 


29 


28 


27 


26 


25 


24 


23 


22 


21 


20 


19 


18 


17 


16 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


15 


14 


13 


12 


11 


10 


9 


8 


7 6 5 


4 3 


2 


1 





XX 


XX 


XX 


XX 


XX 


XX 


XX 


XX 


WTCNT 


SWW 


XX 


XX 


XX 



R/W R/W R/W R/W R/W 



NOTE: xx = reserved bit, read as 0. 
R = read, W = write. 



Expansion-Bus Control Register Bits Summary 



Bit 


Name 


Reset 
Value 


Function 


2 — 


Reserved 


000 


Read as 0. 


4—3 


SWW 


11 


Software wait-state generation. In conjunction with the WTCNT, 
this 2-bit field defines the mode of wait-state generation. It is set 
to 1 1 at reset. 


7 — 5 


WTCNT 


111 


Software wait mode. This 3-bit field specifies the number of cycles 
to use when in software wait mode for the generation of internal 
wait states. The range is zero (WTCNT = 0) to seven 
( WTCNT= 1 1 1) H1/H3 clock cycles. It is set to 1 1 1 at reset. 


31—8 


Reserved 


0-0 


Read as 0. 
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F.4.10 Programmable Bank Switching 

Table F-4. BNKCMP and Bank Size 



BNKCMP 


MSBs Defining a Bank 


Bank Size (32-Bit Words) 


00000 


None 


224 = 16M 


00001 


23 


223 = 8M 


00010 


23—22 


222= 4M 


00011 


23—21 


221 = 2M 


00100 


23—20 


220 =1M 


00101 


23—19 


2"I9=512K 


00110 


23—18 


2"I8 = 256K 


00111 


23—17 


2 17 =128K 


01000 


23—16 


2"I6 = 64K 


01001 


23—15 


2"I5 = 32K 


01010 


23—14 


2"I4 = 16K 


01011 


23—13 


213 =8 K 


01100 


23—22 


212 =4 K 


01101 


23—11 


2"11 = 2K 


01110 


23—12 


210 = 1K 


01111 


23—9 


2 9 =512 


10000 


23—8 


28 =256 


10000 —11111 


Reserved 


Undefined 
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F.5 Instruction Set 

F.5.1 Instruction Formats 

Table F-5. Indirect Addressing 



Mod Field 


Syntax 


Operation 


Description 


Indirect Addressing with Displacement 


00000 


*+ARn(disp) 


addr = ARn + disp 


With predisplacement add 


00001 


*- ARn(disp) 


addr = ARn - disp 


With predisplacement subtract 


00010 


*++ARn(disp) 


addr = ARn + disp 
ARn = ARn + disp 


With predisplacement add and modify 


00011 


*-- ARn(disp) 


addr = ARn - disp 
ARn = ARn -disp 


With predisplacement subtract and modify 


00100 


*ARn++(disp) 


addr = ARn 
ARn = ARn + disp 


With postdisplacement add and modify 


00101 


*ARn - - (disp) 


addr = ARn 
ARn = ARn -disp 


With postdisplacement subtract and modify 


00110 


*ARn++(disp)% 


addr = ARn 

ARn = circ(ARn + disp) 


With postdisplacement add and circular 
modify 


00111 


*ARn - - (disp)% 


add = ARn 

ARn = circ(ARn - disp) 


With postdisplacement subtract and 
circular modify 


Indirect Addressing with Index Register IRO 


01000 


*+ARn(IR0) 


addr = ARn + IRO 


With preindex (IRO) add 


01001 


*-ARn(IR0) 


addr = ARn -IRO 


With preindex (IRO) subtract 


01010 


*++ARn(IR0) 


addr = ARn + IRO 
ARn = ARn + IRO 


With preindex (IRO) add and modify 


01011 


*--ARn(IR0) 


addr = ARn -IRO 
ARn = ARn -IRO 


With preindex (IRO) subtract and modify 


01100 


*ARn++(IR0) 


addr = ARn 
ARn = ARn + IRO 


With postindex (IRO) add and modify 


01101 


'ARn--(IRO) 


addr= ARn 
ARn = ARn -IRO 


With postindex (IRO) subtract and modify 


01110 


*ARn++(IR0)% 


addr = ARn 

ARn = circ(ARn + IRO) 


With postindex (IRO) add and circular 
modify 


01111 


*ARn — (IR0)% 


addr = ARn 

ARn = circ(ARn)-IRO 


With postindex (IRO) subtract and circular 
modify 



LEGEND: 






addr 


= 


memory address 


ARn 


= 


auxiliary register ARO - AR7 


IRn 


= 


index register IRO or IR1 


disp 


= 


displacement 


++ 


= 


add and modify 


-- 


= 


subtract and modify 


circ( ) 


= 


address in circular addressing 


/o 


= 


where circular addressing is performed 
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Table F-5. Indirect Addressing (Concluded) 



Mod Field | Syntax 


Operation 


Description 


Indirect Addressing with Index Register IR1 


10000 


*+ARn(IR1) 


addr = ARn + IR1 


With preindex(IRI) add 


10001 


*-ARn(IR1) 


addr = ARn -IR1 


With preindex (IR1) subtract 


10010 


*++ARn(IRl) 


addr = ARn + IR1 
ARn = ARn + IR1 


With preindex(IRI) add 
and modify 


10011 


* — ARn(IR1) 


addr = ARn-IR1 
ARn = ARn-IR1 


With preindex (IR1 j subtract 
and modify 


10100 


*ARn++(IR1) 


addr = ARn 
ARn = ARn + IR1 


With postindex(IRI) add 
and modify 


10101 


*ARn--(IR1) 


addr = ARn 
ARn = ARn - IR1 


With postindex (IR1) subtract 
and modify 


10110 


*ARn++(IR1)% 


addr = ARn 

ARn = circ(ARn + IR1) 


With postindex (IR1) add 
and circular modify 


10111 


*ARn — (IR1)% 


addr = ARn 

ARn = circ(ARn-IR1) 


With postindex (IR1) subtract 
and circular modify 


Indirect Addressing (Special Cases) 


11000 


*ARn 


addr = ARn 


Indirect 


11001 


*ARn ++ (IRO)B 


addr = ARn 

ARn = B(ARn + IRO) 


With postindex (IRO) add 
and bit-reversed modify 



LEGEND: 

addr = memory address 

ARn = auxiliary register ARO - AR7 

IRn = index register IRO or IR1 

disp = displacement 

++ = add and modify 

— = subtract and modify 

circ( ) = address in circular addressing 

% = where circular addressing is performed 

B = where bit-reversed addressing is performed 
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F.5.2 Summary 

Table F-6. Instruction Set Summary 



Mnemonic 


Description 


Operation 


ABSF 


Absolute value of a floating-point number 


|src| — > Rn 


ABSI 


Absolute value of an integer 


|src| — > Dreg 


ADDC 


Add integers with carry 


src + Dreg + C — > Dreg 


ADDC3 


Add integers with carry (3-operand) 


srd + src2 + C — » Dreg 


ADDF 


Add floating-point values 


src + Rn — > Rn 


ADDF3 


Add floating-point values (3-operand) 


srd + src2 — > Rn 


ADDI 


Add integers 


src + Dreg — > Dreg 


ADDI3 


Add integers (3-operand) 


srd + src2 + — > Dreg 


AND 


Bitwise logical-AND 


Dreg AND src — > Dreg 


AND3 


Bitwise logical-AND (3-operand) 


srd AND src2 — > Dreg 


ANDN 


Bitwise logical-AND with complement 


Dreg AND src— > Dreg 


ANDN3 


Bitwise logical-ANDN (3-operand) 


srd AND src2-> Dreg 


ASH 


Arithmetic shift 


If count > 0: 

(Shifted Dreg left by count) — > Dreg 
Else: 

(Shifted Dreg right by |count|) — » Dreg 


ASH3 


Arithmetic shift (3-operand) 


If count > 0: 

(Shifted src left by count) — > Dreg 
Else: 

(Shifted src right by |count|) — > Dreg 


Bcond 


Branch conditionally (standard) 


If cond = true: 
If Csrc is a register, Csrc — -> PC 
If Csrc is a value, Csrc + PC — > PC 

Else, PC + 1 -» PC 


BcondD 


Branch conditionally (delayed) 


If cond = true: 
If Csrc is a register, Csrc — » PC 
If Csrc is a value, Csrc + PC + 3 -> PC 

Else, PC + 1 -» PC 



LEGEND: 

src general addressing modes 

srd three-operand addressing modes 

src2 three-operand addressing modes 

Csrc conditional-branch addressing modes 

Sreg register address (any register) 

count shift value (general addressing modes) 

SP stack pointer 

GIE global interrupt enable register 

RM repeat mode bit 

TOS top of stack 



Dreg register address (any register) 

Rn register address (R7 — R0) 

Daddr destination memory address 

ARn auxiliary register n (AR7 — ARO) 

addr 24-bit immediate address (label) 

cond condition code (see Chapter 1 1 ) 

ST status register 

RE repeat interrupt register 

RS repeat start register 

PC program counter 

C carry bit 
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Table F-6. Instruction Set Summary (Continued) 



Mnemonic 


Description 


Operation 


BR 


Branch unconditionally (standard) 


Value — > PC 


BRD 


Branch unconditionally (delayed) 


Value -» PC 


CALL 


Call subroutine 


PC + 1 -> TOS 
Value -» PC 


CALLcond 


Call subroutine conditionally 


If cond = true: 
PC + 1 -» TOS 

If Csrc is a register, Csrc — > PC 
If Csrc is a value, Csrc + PC — > PC 

Else, PC + 1 -» PC 


CMPF 


Compare floating-point values 


Set flags on Rn-src 


CMPF3 


Compare floating-point values 
(3-operand) 


Set flags on srd - src2 


CMPI 


Compare integers 


Set flags on Dreg - src 


CMP 13 


Compare integers (3-operand) 


Set flags on srd - src2 


DBcond 


Decrement and branch conditionally 
(standard) 


ARn - 1 -» ARn 

If cond = true and ARn > 0: 
If Csrc is a register, Csrc — > PC 
If Csrc is a value, Csrc + PC + 1 — > PC 

Else, PC + 1 ->PC 


DBcondD 


Decrement and branch conditionally 
(delayed) 


ARn-1 -> ARn 

If cond = true and ARn > 0: 
If Csrc is a register, Csrc — > PC 
If Csrc is a value, Csrc + PC + 3 — > PC 

Else, PC + 1 -> PC 


FIX 


Convert floating-point value to integer 


Fix (src) — > Dreg 


FLOAT 


Convert integer to floating-point value 


Float(src) — » Rn 


IACK 


Interrupt acknowledge 


Dummy read of src 

IACK toggled low, then high 


IDLE 


Idle until interrupt 


PC + 1 -> PC 

Idle until next interrupt 


LDE 


Load floating-point exponent 


src(exponent) — > Rn(exponent) 


LDF 


Load floating-point value 


src — > Rn 


LDFcond 


Load floating-point value conditionally 


If cond = true, src — > Rn 
Else, Rn is not changed 


LDFI 


Load floating-point value, interlocked 


Signal interlocked operation src — » Rn 


LDI 


Load integer 


src — > Dreg 


LDI cond 


Load integer conditionally 


If cond = true, src — ■> Dreg 
Else, Dreg is not changed 
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Table F-6. Instruction Set Summary (Continued) 



Mnemonic 


Description 


Operation 


LDII 


Load integer, interlocked 


Signal interlocked operation src — » Dreg 


LDM 


Load floating-point mantissa 


src (mantissa) — > Rn (mantissa) 


LSH 


Logical shift 


If count >0: 

(Dreg left-shifted by count) — > Dreg 
Else: 

(Dreg right-shifted by |count|) — > Dreg 


LSH3 


Logical shift (3-operand) 


If count > 0: 

(src left-shifted by count) — > Dreg 
Else: 

(src right-shifted by |count|) — > Dreg 


MPYF 


Multiply floating-point values 


src X Rn — > Rn 


MPYF3 


Multiply floating-point value (3-operand) 


srd X src2 -» Rn 


MPYI 


Multiply integers 


src X Dreg — > Dreg 


MPYI3 


Multiply integers (3-operand) 


srd x src2 — > Dreg 


NEGB 


Negate integer with borrow 


- src - C -> Dreg 


NEGF 


Negate floating-point value 


- src — > Rn 


NEGI 


Negate integer 


- src — » Dreg 


NOP 


No operation 


Modify ARn if specified 


NORM 


Normalize floating-point value 


Normalize (src) — > Rn 


NOT 


Bitwise logical-complement 


src — » Dreg 


OR 


Bitwise logical-OR 


Dreg OR src — > Dreg 


OR3 


Bitwise logical-OR (3-operand) 


srd OR src2 — » Dreg 


POP 


Pop integer from stack 


*SP >Dreg 


POPF 


Pop floating-point value from stack 


*SP >Rn 


PUSH 


Push integer on stack 


Sreg -» *++ SP 



LEGEND: 

src general addressing modes 

srd three-operand addressing modes 

src2 three-operand addressing modes 

Csrc conditional-branch addressing modes 

Sreg register address (any register) 

count shift value (general addressing modes) 

SP stack pointer 

GIE global interrupt enable register 

RM repeat mode bit 

TOS top of stack 



Dreg register address (any register) 

Rn register address (R7 — R0) 

Daddr destination memory address 

ARn auxiliary register n (AR7 — ARO) 

addr 24-bit immediate address (label) 

cond condition code (see Chapter 1 1 ) 

ST status register 

RE repeat interrupt register 

RS repeat start register 

PC program counter 

C carry bit 
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Table F-6. Instruction Set Summary (Continued) 



Mnemonic 


Description 


Operation 


PUSHF 


Push floating-point value on stack 


Rn— > *++SP 


REJlcond 


Return from interrupt conditionally 


If cond = true or missing: 

*SP > PC 

1 -> ST (GIE) 
Else, continue 


RET Scond 


Return from subroutine conditionally 


If cond = true or missing: 

*SP > PC 

Else, continue 


RND 


Round floating-point value 


Round (src) — > Rn 


ROL 


Rotate left 


Dreg rotated left 1 bit — * Dreg 


ROLC 


Rotate left through carry 


Dreg rotated left 1 bit through carry — » Dreg 


ROR 


Rotate right 


Dreg rotated right 1 bit — > Dreg 


RORC 


Rotate right through carry 


Dreg rotated right 1 bit through carry — > 
Dreg 


RPTB 


Repeat block of instructions 


src -> RE 
1 -> ST (RM) 
Next PC -> RS 


RPTS 


Repeat single instruction 


src -H> RC 
1 -> ST (RM) 
Next PC -> RS 
Next PC -> RE 


SIGI 


Signal, interlocked 


Signal interlocked operation 
Wait for interlock acknowledge 
Clear interlock 


STF 


Store floating-point value 


Rn -» Daddr 


STFI 


Store floating-point value, interlocked 


Rn -> Daddr 

Signal end of interlocked operation 


STI 


Store integer 


Sreg -> Daddr 


STII 


Store integer, interlocked 


Sreg -> Daddr 

Signal end of interlocked operation 


SUBB 


Subtract integers with borrow 


Dreg - src - C — > Dreg 


SUBB3 


Subtract integers with borrow (3-operand) 


srd - src2 - C — > Dreg 


SUBC ' 


Subtract integers conditionally 


If Dreg - src > 0: 

[(Dreg - src) « 1] OR 1 — > Dreg 

Else, Dreg « 1 — > Dreg 
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Table F-6. Instruction Set Summary (Concluded) 



Mnemonic 


Description 


Operation 


SUBF 


Subtract floating-point values 


Rn-src — » Rn 


SUBF3 


Subtract floating-point values (3-operand) 


srd - src2 — > Rn 


SUBI 


Subtract integers 


Dreg - src — > Dreg 


SUBI3 


Subtract integers (3-operand) 


srd - src2 — •> Dreg 


SUBRB 


Subtract reverse integer with borrow 


src - Dreg - C — > Dreg 


SUBRF 


Subtract reverse floating-point value 


src- Rn — > Rn 


SUBRI 


Subtract reverse integer 


src - Dreg — > Dreg 


SWI 


Software interrupt 


Perform emulator interrupt sequence 


TRAPcond 


Trap conditionally 


If cond = true or missing: 

Next PC -» * ++ SP 

Trap vector N — > PC 

-» ST (GlE) 
Else, continue 


TSTB 


Test bit fields 


Dreg AND src 


TSTB3 


Test bit fields (3-operand) 


srd AND src2 


XOR 


Bitwise exclusive-OR 


Dreg XOR src — > Dreg 


XOR3 


Bitwise exclusive-OR (3-operand) 


srd XOR src2 — > Dreg 



LEGEND: 






src 


general addressing modes 


Dreg 


srd 


three-operand addressing modes 


Rn 


src2 


three-operand addressing modes 


Daddr 


Csrc 


conditional-branch addressing modes 


ARn 


Sreg 


register address (any register) 


addr 


count 


shift value (general addressing modes) 


cond 


SP 


stack pointer 


ST 


GlE 


global interrupt enable register 


RE 


RM 


repeat mode bit 


RS 


TOS 


top of stack 


PC 
C 



register address (any register) 
register address (R7 — RO) 
destination memory address 
auxiliary register n (AR7 — ARO) 
24-bit immediate address (label) 
condition code (see Chapter 11) 
status register 
repeat interrupt register 
repeat start register 
program counter 
carry bit 
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Table F-7. Parallel Instruction Set Summary 



Mnemonic 


Description 


Operation 


Parallel Arithmetic With Store Instructions 


ABSF 
IISTF 


Absolute value of a floating-point 


|src2| —> dstl 
|| src3— >dst2 


ABSI 
||STI 


Absolute value of an integer 


|src2| -> dstl 
|| src3 -> dst2 


ADDF3 
IISTF 


Add floating-point 


srd + src2 — > dstl 
||src3— >dst2 


ADDI3 
IISTI 


Add integer 


srd + src2 — > dstl 
|| src3 — >dst2 


AND3 
IISTI 


Bitwise logical-AND 


srd AND src2 — > dstl 
|| src3 —> dst2 


ASH3 
IISTI 


Arithmetic shift 


If count >0: 

src2 « count — > dstl 
|| src3 — > dst2 
Else: 

src2 » |count| — > dstl 
|| src3 -» dst2 


FIX 

IISTI 


Convert floating-point to integer 


Fix(src2) — > dstl 
|| src3 — >dst2 


FLOAT 
IISTF 


Convert integer to floating-point 


Float(src2) -> dstl 
|| src3 — > dst2 


LDF 
IISTF 


Load floating-point 


src2 — > dstl 
|| src3 — > dst2 


LDI 
IISTI 


Load integer 


src2 — > dstl 
|| src3 — > dst2 


LSH3 

IISTI 


Logical shift 


If count >0: 

src2 « count — > dstl 
|| src3 — > dst2 
Else: 

src2 » |count| — > dstl 
|| src3 — >dst2 


MPYF3 
II STF 


Multiply floating-point 


srd x src2 — > dstl 
|| src3 — > dst2 


MPYI3 
II STI 


Multiply integer 


srd x src2 — > dstl 
|| src3 — > dst2 


NEGF 
IISTF 


Negate floating-point 


0- src2 — > dstl 
|| src3 — > dst2 
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Instruction Set 



Table F-7. Parallel Instruction Set Summary (Concluded) 



Mnemonic 


Description , 


Operation 


Parallel Arithmetic With Store Instructions (Concluded) 


NEGI 
||STI 


Negate integer 


- src2 — > dstl 
|| src3 -> dst2 


NOT 
||STI 


Complement 


srd — > dstl 
|| src3 — > dst2 


OR3 
||STI 


Bitwise logical-OR 


srd OR src2 — > dstl 
|| src3 — >dst2 


STF 
. || STF 


Store floating-point 


srd — > dstl 
|| src3 — > dst2 


STI 

||STI 


Store integer 


srd — > dstl 
|| src3 — » dst2 


SUBF3 

|| STF 


Subtract floating-point 


srd - src2 — > dstl 
|| src3 — > dst2 


SUBI3 
II STI 


Subtract integer 


srd - src2 — > dstl 
|| src3 — > dst2 


XOR3 

|| STI 


Bitwise exclusive-OR 


srd XOR src2 -» dstl 
|| src3 -> dst2 


Parallel Load Instructions 


LDF 
II LDF 


Load floating-point 


src2 -> dstl 
|| src4 — > dst2 


LDI 
||LDI 


Load integer 


src2 — > dstl 
|| src4 — > dst2 


Parallel Multiply And Add/Subtract Instructions 


MPYF3 
|| ADDF3 


Multiply and add floating-point 


op1 x op2 — > op3 
|| op4 + op5 — > op6 


MPYF3 
|| SUBF3 


Multiply and subtract floating-point 


op1 x op2 — > op3 
|| op4 - op5 —> op6 


MPYI3 
|| ADDI3 


Multiply and add integer 


op1 x op2 — > op3 
|| op4 + op5— ■» op6 


MPYI3 
|| SUBI3 


Multiply and subtract integer 


op1 x op2 — > op3 
|| op4-op5— >op6 



LEGEND: 
srd 
src3 
dstl 
op3 



register addr (R7 — RO) 
register addr (R7 — RO) 
register addr (R7 — RO) 
register addr (RO or R1) 



src2 indirect addr (disp = 0,1, IRO, IR1 ) 

src4 indirect addr (disp = 0,1, IRO, IR1 ) 

dst2 indirect addr (disp = 0, 1 , IRO, IR1 ) 

op6 register addr (R2 or R3) 



op1 ,op2,op4,op5 - Two of these operands must be specified using register addr, and two must be specified 
using indirect. 
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A-law compression, 11-51 
A-law expansion, 11-52 
adaptive filters, 11-61 
addition example, 11-35 
addresses 

A23-A0 signals 

additional capacitance, 12-38 
timing requirements, 12-38 

XA12-XA0 signals 

additional capacitance, 12-38 
timing requirements, 12-38 
addressing modes 

conditional branch, 2-14, 5-23 

general, 5-19 

long-immediate, 2-14, 5-22 

parallel, 2-14, 5-21 

three operand, 2-14, 5-20 
addressing types, 5-2 

direct addressing, 5-4 

immediate, 5-17 

indirect addressing, 5-5 — 5-16 

PC relative, 5-18 

register, 5-3, F-6 

analysis subsystem 

pod dimensions, 12-40 

use with debugging tools, 12-37 
applications 

general listing, 1-7 

hardware, 12-1 

software, 11-1 

architecture, 2-1 

archiver, B-3 

arithmetic logic unit (ALU), 2-5 

arithmetic operations, 11-21 

assembler, B-3 



assembly language instructions, 1 0-1 
categories, 10-3—10-8 

interlocked operation, 10-6 

load and store, 10-3 

parallel operation, 10-6 

program control, 10-5 

three-operand, 10-5 

two-operand, 10-4 
coding hints, 11-88 
condition codes, flags, 10-9 
condition for execution, 10-9 
example instruction, 10-16 
opcodes, A-1 
register syntax, 10-15 
symbols used to define, 10-12 — 10-15 
syntax options, 1 0-13 — 1 0-15 
auxiliary (AR0-AR7) registers, 3-4 
auxiliary register ALUs, 2-5 



bank switching, external bus, 7-29, 12-12 

bit manipulation, 11-21 

bit-reversed addressing, 5-29, 11-23 

block move, 11-23 

block repeat register (RS.RE), 3-11 

block size (BK) register, 3-5 

branches, 6-7 

delayed, 6-7, 11-15 
bulletin board, B-1 3 
bus operation 

external, 2-23 

internal, 2-22 
busy-waiting example, 6-12 



C(HLL) routines, 11-88 
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Index 



C compiler 

features, B-3 

hosts supported, B-5 

optimization, B-4 
cache 

hit, 3-20 

miss, 3-21 
cache memory, 2-9, 3-19 

See also memory 

algorithm, 3-20 

architecture, 3-19 

control bits, 3-22 

instruction cache, 3-19 
calls, 6-8 

capacitance of electrical probe, 1 2-38 
cautions, viii 

central processing unit, 2-3 
circular addressing, 5-24 
CLKR pins, 8-18, F-28 
CLKX pins, 8-17, F-27 
clock oscillator circuitry, 12-25 
clocks 

additional capacitance, 12-38 

H1, 12-38, 12-39 

H3, 12-38, 12-39 

timing requirements, 12-39 

companding, 11-48 

compression 
A-law, 11-51 

u.-law, 11-49 

condition codes, flags, 10-11 

conditional delayed branches, 6-7 

conditional-branch addressing modes, 2-14, 5-23 

context switching, 11-11 

conversion 

floating point to integer, 4-22 

integer to floating point, 4-24 
counter example, 6-12 
counter register (timer), 8-2, 8-6 
CPU, 2-3 

CPU interrupt flag register (IF), interrupt flag (IF), 

3-9 
CPU registers, 2-6 

auxiliary (AR0-AR7), 2-7, 3-4 

block repeat (RS, RE), 3-11 

block size (BK), 2-7, 3-5 



CPU/DMA interrupt enable (IE), 3-8 
data page pointer, 2-7 
data page pointer (DP), 3-5 
extended precision (R0-R7), 2-6, 3-4 
I/O flags (IOF), 2-7, 3-10 
index (IR1.IR0), 2-7, 3-5 
interrupt enable (IE), 2-7, 3-8 
interrupt flag (IF), 2-7, 3-9 
list of, 3-3 

program counter (PC), 2-8, 2-22, 3-11 
repeat count (RC), 2-8, 6-2 
repeat count (RP), 3-11 
repeat end address (RE), 2-8, 3-11 , 6-2 
repeat start address (RS), 2-8, 3-11 , 6-2 
reserved bits, 3-11 
status register (ST), 2-7, 3-5, 10-10 
system stack pointer (SP), 2-7, 3-5 
CPU1/2 buses, 2-22 



□ 



data 

D31-D0 signals 

additional capacitance, 12-38 
timing requirements, 12-38 
XD31-XD0 signals 

additional capacitance, 12-38 
timing requirements, 12-38 
data page pointer (DP) register, 3-5 
data receive register (serial port), 8-22 
data transmit register (serial port), 8-21 
delayed branches, 6-7, 11-15 
advantages, 11-89 
incorrectly placed, 6-6, 6-7 
design considerations, 12-37 — 12-42 
electrical information 

capacitance of electrical probe, 12-38 
power supply loading of active probe, 12-38 
mechanical dimensions, 12-40 
timing information, Pages, 12-38 
dimensions (chip), 13-15 
direct addressing, 5-4 
disabled interrupts by branch, 6-7 
displacements, 5-5 — 5-16 
division, 11-24 
DMA 

architecture, 2-26 

buses, 2-22 

channel synchronization, 8-49 — 8-51 
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controller, 8-38 

destination/source address register, 8-42 

general, 2-26 

global control register, 8-39 

interrupt enable register, 8-42 

memory transfer, 8-44 — 8-48 

synchronization of channels, 8-49 — 8-51 

transfer counter register, 8-42 

DMA global-control register format, F-18 
DMA interrupt enable register (IE), 3-2 
documentation, v, B-13 
DRpins, 8-18, F-28 
DX pins, 8-17, F-27 



edge-triggered interrupts, 6-20 

electrical characteristics, 13-18 

electrical information 

capacitance of electrical probe, 12-38 
power supply loading of active probe, 12-38 

electrical specifications, 13-17 

EMU0-EMU6 signals 

additional capacitance, 12-38 
timing requirements, 12-39 

emulator, in system overview, 12-37 

event counters, 8-2 

expansion 

A-law, 11-52 

u.-law, 11-50 
expansion bus 

See also external buses 

addresses, 12-38 

data, 1 2-38 

reads/writes, 12-39 

signal timing, 12-38 

extended precision 

addition example, 11-35 
floating-point format, 4-6 
multiplication example, 11-36 
subtract example, 11-35 

external buses (expansion, primary), 2-23, 7-1 
bank switching, 7-29, 12-12 
expansion bus control register, 7-4 
expansion bus I/O cycles, 7-10 — 7-26 
expansion bus interface, 12-18 



external interrupts, 2-23 
interlocked instructions, 2-23 
primary bus control register, 7-3 
primary bus interface, 12-4 
ready generation, 12-8 
timing 

expansion bus, 7-10—7-26 
primary bus, 7-5 — 7-9 
wait states, 7-27, 12-4, 12-8 

external devices, 12-3 

external interfaces, 12-2 



fast Fourier transforms, 11-66 

FFT, 11-66 

filters 

adaptive, 11-61 

FIR, 11-53 

IIR, 11-55 

lattice, 11-82 

LMS algorithm, 11-61 

FIR filters, 11-53 
floating point 

addition, 4-14 

conversion to integer, 4-22 

division, 11-24 

format conversion, 4-8 

formats, 4-4 

IEEEtoTMS320, 11-38 

inverse, 11-27 

multiplication, 4-10 

normalization, 4-18 

normalized, 4-14 

rounding value, 4-20 

short format, 4-4 

single-precision format, 4-6 

square root example, 11-30 

subtraction, 4-14 

TMS320 to IEEE, 11-38 

underflow, 4-15 

formats 

conversion, 4-8 
floating point, 4-4 
signed integer, 4-2 
unsigned integer, 4-3 

FSRpins, 8-18, F-28 

FSXpins, 8-17, F-27 
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general addressing modes, 2-14, 5-19 
global control register (serial port), 8-14 
global control register (timer), 8-3 
global memory, 6-10, 6-13 



H1 signal 

additional capacitance, 12-38 
timing requirements, 12-39 

H3 signal 

additional capacitance, 12-38 
timing requirements, 12-39 

hardware applications, 12-1 

hardware control, 6-1 

header (XDS1 000), 12-34 

HOLDA, signal 

additional capacitance, 12-38 
timing requirements, 12-39 

hotline, B-13 



I/O flags register (IOF), 3-10 

IACK, signal 

additional capacitance, 12-38 
timing requirements, 12-39 

MR filters, 11-55 

immediate addressing, 5-17 

index (IR0.IR1) register, 3-5 

indirect addressing, 5-5 

initialization (processsor), 11-3 

instruction cache, 3-19 

instruction register (IR), 2-22 

instruction set summary 
alphabetical, 2-14 
function listing, 10-3 

instructions. See assembly language 

TNT0-|NT3,3-17, F-14 

TNT0-TNT3, signals 

additional capacitance, 12-38 
timing requirements, 12-39 

integer division, 11-24 

integer formats 



short integer, 4-2 

signed, 4-2 

single-precision integer, 4-2 

unsigned, 4-3 
interfaces 

expansion bus, 2-23, 12-18 

primary bus, 2-23, 12-4 

system control, 12-25 

types, 12-2 
interlocked operations, 6-10 
internal bus, 2-22 
interrupt enable register (IE), 3-8 
interrupt service routines, 11-10 
interrupts, 2-23, 6-20 

context switching, 11-11 

control bits, 6-20 

interrupt service routines, 11-10 

prioritization, 6-25 

prioritizing, 11-14 

processing, 6-27 

serial ports, 8-27 

vectors, 3-1 6, 6-25 
inverse, 11-27 
inverse lattice filter, 11-83 
IOSTRB, signal, 7-2, 7-5 

additional capacitance, 12-38 

timing requirements, 12-40 



lattice filters, 11-82 

level-triggered interrupts, 6-20 

linker, B-3 

literature, v, B-13 

logical operations, 11-21 

long-immediate addressing modes, 2-14, 5-22 

looping, 11-16 

LRU cache update, 3-1 9 



matrix-vector multiplication, 11-64 

mechanical data, 13-15 

mechanical dimensions, 12-40 

memory, 2-9,3-19 

accesses (pipeline), 9-21 
cache, 2-9, 3-19, 11-89 
general organization, 2-9 
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global, 6-10, 6-13 
memory maps, 2-11,3-12 
microcomputer mode, 3-12 
microprocessor mode, 3-12 
pipeline conflicts, 9-9, 9-19 
quick access, 11-89 

microcomputer mode, 2-11,3-12 

u.-law compression, 11-49 

(i-law expansion, 11-50 

microprocessor mode, 2-11, 3-12 

MSTRB, signal, 7-2, 7-5 

additional capacitance, 12-38 
timing requirements, 12-40 

multiple processors, 6-10 

multiplication, 11-64 

multiplier, 2-5 



nested block repeats, 6-6 

normalization, floating-point value, 4-14, 4-18 



object program converter, B-3 
opcodes, A-1 

optimizer (C compiler), B-4 
ordering information, B-15 
overflow, 4-15, 4-22 

overview, emulation system with analysis subsys- 
tem, 12-37 



parallel addressing modes, 2-14, 5-21 
part numbers, B-15 

breakdown of numbers, B-18 
prefix designators, B-1 6 
period register (timer), 8-2, 8-6 
peripheral bus, 2-24, 8-1 
general architecture, 2-24 
map, 3-18 
peripherals on, 8-1 
DMA controller, 8-38 
serial port, 2-25, 8-12 
timers, 2-25, 8-2 
register diagram, 2-24 



pin assignments, 13-4, 13-5 
pin states at reset, 6-1 6 
pipeline, 9-1 

conflicts 

avoiding, 11-89 
branching, 9-4 
memory, 9-9 

memory (resolving), 9-19 
registers, 9-6 

memory accesses, 9-21 

structure, 9-2 
power, supply loading of active probe, 12-38 
primary bus 

See also external buses 

addresses, 12-38 

data, 12-38 

reads/writes, 1 2-39 

signal timing, 12-38 
primary bus control register, 7-3 
probes 

active probe power-supply loading, 12-38 

electrical probe capacitance, 12-38 
program buses, 2-22 
program counter (PC), 2-22, 3-11 
program flow, 6-1 



quality, C-1 
queues (stack), 5-32 



R/W, signal 

additional capacitance, 1 2-38 

timing requirements, 12-39 
RAM, 2-9 

See also memory 
ready generation, 12-8 
receive/transmit timer counter register (serial port), 

8-21 
receive/transmit timer period register (serial port), 

8-21 
regional technology centers, B-1 4 
register buses, 2-22 
registers, 2-6 

auxiliary (AR0-AR7), 2-7, 3-4 

block repeat (RS, RE), 3-11 
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block size (BK), 2-7, 3-5 

counter (timer), 8-6 

CPU/DMA interrupt enable (IE), 3-8, 8-42 

data page pointer, 2-7 

data page pointer (DP), 3-5 

DMA destination and source address, 8-42 

DMA global control register, 8-39 

DMA transfer counter register, 8-42 

DMA/CPU interrupt enable (IE), 8-42 

extended precision (R0-R7), 2-6, 3-4 

global control (timer), 8-3 

I/O flag (IOF), 3-10 

I/O flags (IOF), 2-7 

index (IR1.IR0), 3-5 

interrupt enable (IE), 2-7, 3-8 

interrupt flag (IF), 2-7, 3-9 

maximum use, 11-89 

period (timer), 8-6 ' 

peripheral port, 8-1 

pipeline conflicts, 9-6 

program counter (PC), 2-8, 2-22, 3-11 

repeat count (RC), 2-8, 6-2 

repeat count (RP), 3-11 

repeat end address (RE), 2-8, 3-11 , 6-2 

repeat start address (RS), 2-8, 3-11 , 6-2 

reserved bits, 3-11 

serial port global control, 8-14 

serial port registers, 8-12 — 8-37 

status register (ST), 2-7, 3-5, 10-10 

system stack pointer (SP), 2-7, 3-5 
registers, general, see also CPU registers, 2-5 
reliability, C-1 

stress testing, C-2 
repeat count register (RC), 3-11 , 6-2 
repeat end address register (RE), 3-11 , 6-2 
repeat mode 

initialization, 6-2 

RPTS initialization, 6-3 
repeat modes, 11-16 

repeat start address register (RS), 3-11 , 6-2 
requirements, signal timing, Pages, 12-38 
reset, 3-16, 6-16 

initialization (processor), 11-3 

operations performed, 6-19 

pin states, 6-16 

vectors, 3-16, 6-25 
RESET signal 

additional capacitance, 12-38 

timing requirements, 12-39 



return from subroutine, 6-8 
RINTO.1,3-17, F-14 
ROM, 2-9 

See also memory 
rounding of floating-point value, 4-20 
RTCs, B-14 



scan path interface, B-10 

segment start address (SSA) reg., 3-19 

semaphores, 6-13 

seminars, B-14 

serial port, 8-12— 8-37 

data receive register, 8-22 

data transmit register, 8-21 

global control register, 8-1 4 

interrupt sources, 8-27 

port control register (FSR/DR/CLKR), 8-18, F-28 

port control register (FSX/DX/CLKX), 8-17, F-27 

receive/transmit timer control register, 8-19, F-29 

receive/transmit timer counter register, 8-21 

receive/transmit timer period register, 8-21 

timing, 8-24, 8-28—8-37 
serial port global control register, 8-14 
short floating-point format, 4-4 
signal descriptions, 1 3-9 — 1 3-1 4 
signal transition levels, 13-19 
signals, timing information, Pages, 12-38 
simulator, B-5 
software applications, 11-1 
software control, 6-1 
software development, B-2 

archiver, B-3 

assembler, B-3 

emulator (XDS500 & 1 000), B-8 

linker, B-3 

macro assembler, B-3 

object format converter, B-3 

scan path interface, B-10 

simulator, B-5 
SPOX, B-6 

square root example, 11-30 
stack, 5-30, 5-32, 11-9 

queues, 5-32 
stack pointer (SP) register, 3-5 
status register (ST), 3-5, 10-10 
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STRB, signal, 7-2, 7-5 

additional capacitance, 12-38 

timing requirements, 12-40 
style (manual), vii 
subroutines, 11-7 — 11-20 

computed GOTO, 11-20 

context switching, 11-11 

runtime select, 11-18 
subtract example, 11-35 
symbols (used in manual), vii 
synchronize 2 processors example, 6-15 
system overview, 1 2-37 



target cable, 12-37 

target system, design considerations, 12-37 — 12-42 

electrical information, 12-38 

mechanical dimensions, 12-40 

timing information, Pages, 12-38 
target system connection, 12-34 
TCLKO, TCLK1 , signals 

additional capacitance, 12-38 

timing requirements, 12-39 
test load circuit, 13-18 
third party support, B-12 
three-operand addressing modes, 2-14, 5-20 
timer global control register, 8-3 
timers, 2-25, 8-2— 8-11 

counter, 8-2 

counter register, 8-6 

operation nodes, 8-7 

period register, 8-2, 8-6 
timing parameters, 13-20 — 13-51 
timing/counting, TMS320C30 signal timing, 

Pages, 12-38 
TiNTO.1,3-17, F-14 



trap, 3-16, 6-8 

vectors, 3-16 
TTL levels, 13-19 



underflow, 4-14 



vectors (reset, interrupts), 6-25 
vectors (reset, trap, interrupt), 3-16 



M 



wait states, 7-27 

external bus, 12-4, 12-8 

zero, 12-4 
workshops, B-14 



XDS1000, 12-34, B-8 

XDS500, B-8 

XF0.XF1, signals 

additional capacitance, 1 2-38 
timing requirements, 12-39 

XF0.XF1.2-23 

XINT0.1, 3-17, F-14 

XR/W signal 

additional capacitance, 1 2-38 
timing requirements, 12-39 



zero wait states, 12-4 
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